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A SIMPLIFIED ELECTROMAGNETIC THEORY OF THE INDUCTION MOTOR, 
USING THE CONCEPT OF WAVE IMPEDANCE 


By Professor A. L. CULLEN, Ph.D., B.Sc., and T. H. BARTON, Ph.D., B.Eng., Associate Members. 


(The paper was first received 8th July, and in revised form 30th October, 1957. 


SUMMARY 


Electromagnetic field theory is applied to the analysis of performance 
of an induction-motor rotor when it is exposed to a rotating magnetic 
field of constant amplitude. An idealized model of the rotor is taken 
and the analysis is simplified by the application of the concept of wave 
‘impedance. The model rotor is capable of simulating the effects of 
tooth-top and zigzag leakage fluxes, but end effects are neglected. The 
equations of performance thus obtained are shown to be identical with 
those obtained by the conventional theory when the latter includes 
skin effect. The model motor used by Mishkin (Quarterly Journal of 
Mechanics and Applied Mathematics, 1954, 7, p. 472) is shown to give 
results widely different from those obtained in practice, since it neglects 
the zigzag leakage fluxes. The analogy between the tangential force 
acting on the rotor and radiation pressure is indicated, and it is shown 
that a change of variable familiar in microwave theory leads to the 
theory of the variable-speed induction motor developed by Williams 
and Laithwaite [Proceedings I.E.E., Paper No. 2097 U, June, 1956 
(404, A, p. 102)]. 


LIST OF PRINCIPAL SYMBOLS 


a = Depth of tooth-top region of rotor. 
b = Depth of rotor bars. 
b, = Width of rotor bars. 
B,, = Maximum magnetic flux density. 
d = Width of opening between tops of teeth on rotor. 
E, = Electric field strength in equivalent uniform rotor. 
f = Force per unit volume on equivalent uniform rotor. 
F = Force per unit area of rotor surface. 
H,, H, = Magnetic field components in equivalent uniform 
j rotor. 
J, = Current density in equivalent uniform rotor. 
DP; = Slot pitch. 
R = Rotor-bar resistance. 
s— Slip. 
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Xo = Equivalent standstill rotor-bar reactance. 
x, y = Rectangular co-ordinates. 
Zo1, Zo2 = Wave impedances in tooth-top and rotor-bar regions 
respectively. 
« = Skin effect parameter: a—! = skin depth. 
B = Phase-change coefficient of flux wave. 
Y1> Y2 = Propagation coefficients looking into rotor through 
tooth-top and rotor-bar regions respectively. 
Lo = Absolute permeability of free space. 
bt, = Relative permeability. 
Hex, [ys Lex» by = Absolute permeabilities of equivalent uniform 
anisotropic media. 


(1) INTRODUCTION 


The induction motor, like all other electrical devices, must obey 
the laws of electricity and magnetism as formulated in Maxwell’s 
equations of the electromagnetic field. However, a practical 
induction motor is so complicated in its geometry that an exact 
solution of these equations is quite impracticable, even if the 
effects of non-linearity and hysteresis in the iron are ignored. 

A working theory of the induction motor has, however, been 
developed by making use of a.c. circuit theory. The parameters 
introduced into this theory can be evaluated theoretically by 
individual field calculations, usually of an approximate character. 

In a rather indirect sense, this theory can be regarded as an 
approximate solution of Maxwell’s equations, for the a.c. circuit 
equations themselves follow as a consequence of Maxwell’s 
equations, subject to certain definable limitations. 

Recently, however, an approximate solution of Maxwell’s 
equations for an idealized model of an induction motor has been 
obtained by Mishkin.! His model is linear and replaces the 
practical toothed structure of stator and rotor by continuous 
though inhomogeneous regions, having appropriately averaged 
resistivities, and different permeabilities in directions parallel to, 
and perpendicular to, the air-gap. The effect of end windings is 
taken into account in the theory. The analysis, though approxi- 
mate, leads to results which are exceedingly complicated, so that 
the connection between Mishkin’s equations and the usual 
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performance equations of the induction motor are not readily 
discernible. His analysis ignores the large part played by tooth- 
top and zigzag leakage flux in a practical machine. This omis- 
sion leads to performance equations in which a principal com- 
ponent of leakage reactance is that of the rotor-slot, which, as is 
well known, is exceedingly dependent upon the slip. As a conse- 
quence, a current locus which diverges considerably from the 
circle diagram is given in a particular numerical example, and 
Mishkin attributes the divergence to skin effect. In fact, skin 
effect would be negligible over a practicable speed range in the 
example taken. Such divergence as is observed in practice is 
primarily attributable to saturation of the leakage-flux paths. 

The purpose of the present paper is to present a more ele- 
mentary electromagnetic theory of the induction motor in which, 
for simplicity, end effects are neglected, but which pays due atten- 
tion to the part played by tooth-top leakage fluxes. The use of 
a simpler model facilitates comparison with conventional theory 
and moreover makes possible an exact solution of the problem. 

The solution of the field equations is simplified, and given a 
clear physical significance, by making use in the analysis of the 
concept of wave impedance, which has for many years been a 
corner-stone of microwave theory, but which, so far as the 
authors are aware, has not previously been invoked in the theory 
of electrical machinery. 

A further point of interest is that a simple change of variable, 
familiar in microwave circuit theory, leads at once to the theory 
of the brushless variable-speed induction motor of Williams and 
Laithwaite.3 

It is shown, furthermore, that the theory of the squirrel-cage 
induction motor is closely related to the theory of radiation 
pressure and to the theory of electromagnetic surface waves. 
The force acting on the rotor of such a machine can in fact be 
legitimately described as the radiation stress due to a surface 
wave incident on the rotor at a complex angle of incidence of the 
form 7/2 + js, where ys is much greater than unity. 


(2) CIRCUIT ANALYSIS 


The purpose of this Section is to state briefly the results of the 
circuit method of analysis of the induction motor, for comparison 
with the field-analysis results which are obtained in the following 
Section. 

We consider a motor in which the flux density in the air-gap is 
a sinusoidal travelling wave of maximum value B,,. The time- 
average tangential force F per unit area of rotor surface is given by 

__ woTB?, sR 
Op Ra a patel tsa) 
where w is the stator frequency; s, the slip; 7, the pole pitch; 
Ps the slot pitch; R, the resistance per unit length of the rotor 
bars; and Xo, the rotor-bar reactance per unit length at standstill. 

If R and Xp can be regarded as constants, eqn. (1) gives the 
relationship between torque and slip for an induction motor 
quoted in elementary textbooks. In fact, however, R and Xo 
depend on the rotor frequency because of skin effect. Formulae 
aliowing for skin effect are given by Alger.4 For our case and 
our notation they can be written 


a sinh 2ab + sin 2ab 


ye Gog Coste 2OD —'COS 2b. 7 ae eee Bes (2) 


a 1 o 
Xo = Wolo G as 


sinh 2ab — sin 2ab 
)- @ 


SWol4q Tob, cosh 2b — cos 2ab 
where gg is the conductivity of the rotor bars and 


ay /S@ol40%0 
=, Meal «figs, ees) 


The rest of the notation is given in Fig. 1. 
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Fig. 1.—Slotted linear model motor. 


The term w9{49(a/d) in eqn. (3) is the reactance due to tooth-top 
leakage, which is dependent solely on the total conductor current 
and is unaffected by a non-uniform current density. 

Egns. (2) and (3) can, of course, be expanded in powers of «b, 
and the familiar formulae for R and Xp neglecting skin effect 
can easily be derived in this way. To a first approximation, we 
find that 

1 


Oobb, 


Woo G at 7) 


These formulae are applicable if 2«b < 1. 


R~ (2a) 


l2 


Xo (3a) 


(3) FIELD ANALYSIS 


The first step in applying the field method of analysis is to 
formulate a model of an induction motor which is simple enough 
to make such an analysis possible. 


(3.1) Formulation of Idealized Problem 


We first ignore the curvature of the air-gap and take a linearized 
cross-section of the actual machine as shown in Fig. 1. The — 
effect of stator slots is also ignored. To facilitate analysis, the 
toothed structure of Fig. 1 is replaced by the continuous structure 
comprising regions 1 and 2 of Fig. 2. It is assumed that the 


USSR 


Fig. 2.—Uniform linear model motor. 


rotor core, region 3, and the stator are of infinite permeability 
and zero conductivity. Region 1 represents the air-gap and 
tooth tops of Fig. 1, and therefore its conductivity is taken as 
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zero, whilst region 2 represents the conductor region of Fig. 1 
and has a mean conductivity of 


oy) — ae 5) 
al BSC otis BRAS i ES) 

The effective permeability, on the other hand, is more difficult 
to evaluate. It is clear, in the first place, that the average reluc- 
tance of a path parallel to the rotor surface and passing through 
the slots is much greater than that in a direction perpendicular 
to the rotor surface. Thus the continuous region 2 in Fig. 2, 
which corresponds to the slotted region in Fig. 1, must be 
anisotropic in its magnetic properties. Similar remarks apply 
to region 1 in Fig. 2, which corresponds to the tooth-top region 
in Fig. 1. 

An exact determination of the effective permeabilities is very 
difficult, but if fringing is neglected, simple magnetic-circuit ideas 
can be applied to give very crude approximations. 

For region 1: 


Py = Mor 
ag feoeip (6) 
By — B;Ho 


For region 2: 


Px = 


| 
is 
r—) 

& 


(7) 
Py & [yg ——— 


In these equations pu, is the relative permeability of the iron. 
This will in practice be of the order of 4000, and has been taken 
‘as infinite elsewhere in formulating eqns. (6) and (7). It is also 
taken as infinite in region 3, which represents the unslotted rotor 
core. 

(3.2) Field Analysis 


Maxwell’s equations in region 2 are: 


dE 
oe EP ae: 8 
jw pH, > (8) 

: dE, 
—jwp,H, = — <7 os (9) 

ele OMNEG. 

Se 10 
Le Sei ey a 


In these equations a time factor ¢/°! is assumed, and w = swo 
is the angular frequency of the rotor currents. The co-ordinates 
x and y are measured in a co-ordinate system fixed to the rotor. 
Derivatives with respect to z have been set equal to zero since 
end effects are neglected, and current flow is assumed to be 
wholly in the z-direction, so that H, has been put equal to zero. 
Displacement currents have also been neglected. 

All field quantities vary with x as e~/®*, where B = 2n/A = m7; 
i, the wavelength of the travelling wave, is equal to twice the pole 
pitch. Subsituting —jP for d/dx in (9) and (10) and eliminating 
H, between these two equations, we get 


2 
eye 
Why oy 

If a solution of eqns. (8) and (11) for E, can be found the 
problem is solved, for H, can easily be found from E, using 
eqn. (9). Thus (8) and (11) are the basic equations. Their 
solution is easily found by making use of the wave-impedance 
concept and employing a transmission-line analogue. Corre- 
“ponding quantities in the analogue are most easily seen by 
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setting out the relevant equations side by side, as Booker has 
done in dealing with other electromagnetic problems.? 


Induction motor Transmission line 


0£F, A 
—-—= JopxHs 


oy 


32 
_ x es ( ae B )zz 
oy JO fly 


There is clearly a one-to-one correspondence between the 
following quantities 

E,>V ] 

H,—> TI | 


jop, > Z (12) 


2 
o + B > Y | 
Jy 
Since Z and Y represent respectively the series impedance and 
shunt admittance per unit length of the transmission line, we can 
construct for the induction motor a transmission line equivalent 
which represents exactly the relationship between E, and H,, in 
region 2. An elementary section of this line is shown in Fig. 3. 


jeopsy 
2 
Sy 


jop,oy 


Fig. 3.—Element of equivalent transmission line. 


The characteristic wave impedance and the propagation coefficient 
for the y-direction can be written down by inspection 


JOpx _ JOpx 
oe 2 
702 o + —_— p He 
JOpPy (13) 
IE | 

= (oy ties|| Cor == 

ve E Le ( jon, 
The suffix ‘2’ refers to region 2 of the induction-motor model 


in Fig. 2. oak 
By exactly the same argument, we can set up a transmission- 
line model for region 1. The characteristic wave impedance and 
the propagation coefficient can be obtained from eqn. (13) by 
replacing , by p,, fy by w,, and putting o = 0. This gives 


; toa CS) 


334 


In order to complete the analogy, we must consider boundary 
conditions. At the interface between regions 1 and 2, E, and 
H,, must be continuous. Thus, in the analogue, V and J must 
be continuous at the junction of the two dissimilar transmission 
lines representing regions 1 and 2. This continuity follows from 
Kirchhoff’s laws. 

In region 3, H, must be zero everywhere since the permeability 
is infinite. In particular, H, =O at the boundary between 
regions 3 and 2. In the analogue we must have J = 0 at the 
corresponding point. The transmission line for region 2 is 
therefore open-circuited at the corresponding end. The open- 
circuit condition can alternatively be obtained by noting that, 
since pp = © in region 3, the associated characteristic wave 
impedance is infinite, and this is the ‘termination’ of region 2. 

Fig. 4 shows the transmission-line analogue of the interaction 


0.C 
Zz 
ro) § %, 02 
a) 
on b 
wD Ey 
iv 
—_- 
E. 
Hx. v2 


REGION 1 
° Oe 
m 
SS 


Fig. 4.—Equivalent transmission line system. 


region of the induction motor, regions 1 and 2 of Fig. 2. The 
sending-end and receiving-end voltages of a transmission line 
terminated by an impedance Z, are related as follows: 


Vs = Vr (cosh yl + 2 sinh yl) 


Using the equivalences set out in (12) and referring to Fig. 4, 
the relationship between E,, and E,, can be written 
= Zo} 4 
E,, = E,.( cosh yya + Z, sinh y,a (15) 
2 
Here, Z, is the input wave impedance of region 2, which, by 
analogy with an open-circuited transmission line, is 


Z, = Zo2 coth y2b (16) 


(17) 


Making use of the well-known formula for the voltage distribu- 
tion on an open-circuited transmission line, we can express the 
electric field strength E, at any distance € from the interface 
between regions 2 and 3 in terms of E,, as follows: 


Z 
Minus eee (cosh ya + =! tanh Y2b sinh via) 
22 


cosh yé 
E — a’ 
2 En cosh yb eo 
Combining (17) and (18), we have 
cosh 
BE, =, vas (19) 


Zoi. 
cosh y,a cosh y2b + z sinh y,a sinh yb 
02 
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We have now found the form of the field distribution in region 2, 
but the absolute magnitude is not yet determined. We take as 
datum the maximum value of the magnetic flux density B,, in 
the air-gap. Since relativistic effects can be ignored at the low 
velocities involved, this quantity is unaltered by transformation 
to co-ordinates fixed to the stator, and can be equated to B,, 
introduced in the circuit analysis. 
The air-gap flux density can be written 


By = Be” 8 


(20) 


Since the normal component of magnetic flux density is con- 
tinuous at an interface, eqn. (20) also applies just inside 
region 1. But in region 1 eqn. (9) applies if pw, is replaced 
by p,. Also B, = w,H,. Hence 


By Sue eee 
Combining (19) and (21) we have 


cosh y2€é 
B a Zot 3 : 
( osh y,a cosh y2b + Zz. sinh y,a sinh yb 
02 


ia mas wB,, 


(22) 


This is the exact solution for the rather highly idealized model 
we have chosen to consider. 

To calculate the tangential force on the rotor, we start with 
the force per unit volume exerted in the x-direction by the 
magnetic field on the current. This is 


= —J,By 


If J, and B, are interpreted as complex amplitudes rather than 
instantaneous values, the time-average force density is 
f=" — GLIB Se eee 


Using eqn. (21) and remembering that J, = cE,, we have 
(24) } 


Substituting (22) in (24) and remembering that |B,,| = B,,, 
we get 


cosh y€ cosh y¥é 
3 (25) 


Zi 
cosh y,a cosh y2b + a sinh y,a sinh yb 
02 


To find the force per unit area of rotor surface we must integrate 
(25) with respect to € between the limits 0 and b. The result is 


pee owB;, _y2 sinh y2b cosh y¥b — y* sinh y¥b cosh yb 
x 2p 


Ze 
(= 799) cosh ya cosh y2b + Zit sinh yya sinh yb 
(26) 


This equation can be put into a form more convenient for 
practical application. In the first place, for all reasonable values 
of y,; and a, it is perfectly adequate to write 


Ri 
v14 


Secondly, y2 can be approximated as 


ya = View, = 4 (Va + 7) 


cosh y,a 


l2 


(27) 


2 


sinh y,a 
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But, from eqns. (6) and (7), we see that [xT = [gOo. 
Thus y, can be expressed in terms of the parameter « defined by 
eqn. (4), as follows, 

Vo = ody) (28) 
Substituting (27) and (28) in (26), putting 8 = 7/7 and simplifying, 
we get 
_ WoT Bn Fobss 
Mesyi, We? 


sinh 2«b + sin 2ab 
2n? (cosh 2ab — cos 2«b) 
+ 2n (sinh 2b — sin 2«b) 
+ (cosh 2«b + cos 2«b) 


aab, 


d 


It is not immediately obvious from eqn. (29) that the field-theory 
result agrees with that obtained by circuit analysis. However, if 
eqns. (2) and (3) are substituted in (1), the resulting formula can 
be reduced exactly to eqn. (29). 

It is at first sight remarkable that this exact agreement should 
be found between the two formulae, for approximations have 
been made in deriving both, and the approximations are not 
obviously equivalent. However, a deeper examination of the 
problem shows the agreement is not unreasonable. The tradi- 
tional analysis is founded upon the superposition theorem, in 
that the fields produced by the rotor currents are considered 
separately from the externally applied field which produces these 
currents. Further there is the axiom, usually unstated, that the 
tangential force on the rotor due to the interaction of its currents 
and their own field is zero. This follows from the magnetic 
uniformity of the rotor and stator in the tangential direction. 
Hence it follows that the only relevant force is that between the 
radial component of the externally applied field and the conductor 
cross-section. Because of the high permeability of the teeth it 
can be assumed that the flux density is independent of depth, and 
hence the force per conductor per unit width is equal to the 
product of flux density and total conductor current, and this in 
fact is the manner of derivation of eqn. (1). 

In applying the field theory the position is complicated since 
the resultant of the externally applied and the self-generated fields 
is considered. However, using the previous axiom it is evident 
that the answer must be identical with that obtained by con- 
sidering only the externally applied component. 

The term f?/(jwp1,) in eqn. (13) determines the decay of the 
externally applied field in the rotor slot region. . This is easily 
seen by putting o = 0 in that equation; there are then no rotor 
currents and no magnetic field other than the externally applied 
field. Neglecting £?/jw., in comparison with o, as in eqn. (28), 
is therefore equivalent to neglecting the decay of the externally 
applied field in the rotor slot region. The approximation (27) is 
equivalent to neglecting the decay of the externally applied field 
in the tooth-top region. Hence the exact equivalence of the 
two results. 


x 


(29) 


where 


(30) 


(3.3) Numerical Example 


To illustrate the formulae numerically we have taken a machine 
with the following rotor dimensions: 


Tooth top depth a=1cm 
Slot depth b=2cm 
Slot width ‘ oo De = lier 
Slot pitch .. a ap — ol 
Slot opening .. : vad O'S: cin 


The apparently excessive tooth top depth has been chosen to 
vaake some allowance for the zigzag reactance which would 
eccur in a normal machine. In Fig. 5, torque/slip curves at 
constant air-gap flux density have been plotted. Curve (a) is the 
esult obtained from the field theory or from the usual eqn. (1) 
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TORQUE (ARBITRARY UNITS) 


SLIP 


Fig. 5.—Torque/slip characteristics. 


(a) Double-layer theory. 
(b) Neglecting skin effect. 
(c) Single-layer theory. 


together with (2) and (3). Curve (4) is obtained from (1) using 
(2a) and (3a); i.e. it neglects skin effect. 

A deep-bar rotor has been selected to emphasize the well- 
known importance of skin effect in such cases; the standstill 
torque is approximately doubled. 


(3.4) Comparison with Mishkin’s Analysis 


In Mishkin’s paper! the rotor is represented by a much simpler 
model than that used in the present paper, though he has, of 
course, taken end effects into account and dealt with the stator 
as wellas the rotor. Without going into so much detail, however, 
we shall show that his model of the rotor is too simple to represent 
the facts accurately, and in fact it greatly over-exaggerates skin 
effect. 

Mishkin’s analysis does not take proper account of the shape 
of the slot or of the fact that the rotor bars are embedded in the 
rotor and are separated from the rotor surface by a non-conduct- 
ing (ideally) mixture of iron and air. Whilst we have taken 
account of this fact by using a double-region model for the 
electromagnetic field analysis with one region (No. 2) to represent 
the conductor region and a second region (No. 1) to represent 
the tooth-top region, Mishkin has used a single region to represent 
both. Our formulae can easily be applied to a single-region 
model by putting a = 0. The effective conductivity of the single 
layer is easily obtained, and the average values /Z, and jz, can be 
obtained from pu, and w,, and w, and py, by combining reluc- 
tances for horizontal and vertical flux paths in parallel and in 
series respectively. When this is done, the single-layer theory 
yields the result shown as curve (c) in Fig. 5. It is clear that 
skin-effect is much exaggerated. 

It should be noted that the Mishkin results can be made to 
fit the case of a practical machine if the permeability of the air-gap 
in the x-direction is increased so as to allow for the effects of 
tooth-top and zigzag leakage fluxes. 


(4) FIELD THEORY OF THE BRUSHLESS VARIABLE- 
SPEED INDUCTION MOTOR 
A simplified theory of the very elegant motor described by 
Williams and Laithwaite* can be derived by the methods used 
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in this paper. We shall not go into details, but merely indicate 
the main features of interest. In the first place, it is well known 
in microwave theory that certain types of discontinuity in wave- 
guides can be treated theoretically by first examining the corre- 
sponding discontinuity in a parallel-plate line carrying a TEM 
wave, and then transforming the solution back to the waveguide 
case by simple substitutions. 

For example,> the susceptance of a capacitive diaphragm in a 
rectangular dominant-mode waveguide can be obtained from the 
corresponding strip-line formula by replacing A by dg: This 
transformation arises from the fact that a 3-dimensional problem 
with a harmonic field variation in say the z-direction leads to a 
wave equation which is formally identical with a 2-dimensional 
wave equation if the wavelength is properly chosen. Physically, 
the waveguide mode can be constructed from obliquely-travelling 
plane waves, the velocity of each of which measured normal to its 
wavefront has the constant value c. The phase velocity of the 
composite wave is, however, increased to csec 8, where @ is the 
angle between the plane-wave direction and the axis of the wave- 
guide, and sec 6 = A,/A. 

Exactly the same argument can be applied to the induction 
motor, and the speed increase associated with obliquely travelling 
air-gap flux waves follows at once. 


(5) CONCLUSIONS 


It has been shown that the concept of wave impedance can be 
used in developing a simple electromagnetic field theory of the 
induction motor which leads to identically the same formula for 
the torque-slip relationship as the conventional circuit analysis 
with skin-effect corrections. 

Thus the same equation may be regarded either from a circuit 
point of view as an approximate solution of the actual problem, 
or from a field point of view as the exact solution of an idealized 
problem. 

An inadequacy in Mishkin’s earlier electromagnetic theory of 
the induction motor has emerged, and the need for a 2-region 
model of the rotor has been demonstrated. 

The fundamental connection between the action of an induc- 
tion motor and the phenomenon of radiation pressure should be 
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mentioned. The closest analogy is with the tangential stress due 
to radiation obliquely incident on an absorbing surface. This 
effect has been observed by Barlow and Poynting when light falls 
obliquely on a blackened surface. 

In the induction motor, the incident wave is not a simple plane 
wave but a surface wave. The principle is the same, however. 
In both cases the incident field sets up currents which are acted 
upon by the magnetic field of the incident wave to produce a 
mechanical force. That the forces are so different in magnitude 
in the two cases tends to conceal the underlying unity; this is 
essentially due to the enormous difference in wave velocity in 
the two cases. 

For the induction motor at standstill F = P/v, where P is the 
power delivered to the rotor and v is the velocity of the flux 
wave. The radiation stress due to light, on the other hand, is 
of the order of P/c, where c is the velocity of light. Bearing in 
mind that in the optical case P could hardly exceed 1 watt, the 
difference in the magnitudes of the forces is easily understandable. 

Finally, having established a link between microwaves and 
induction motors, it is suggested that some of the theoretical 
methods developed for microwaves might usefully be examined 
by the machine designer. In particular, the variational methods 
for dealing with discontinuities, and the various techniques for 
handling periodic structures of various kinds, might find new 
application in the field of electrical machinery. 
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SUMMARY 


In very weak fields the eddy-current losses in a lamination may be 
calculated exactly but the observed loss is always somewhat greater. 
Most of this extra loss varies with frequency and field in the same way 
as eddy-current loss, and if a quantity 7 equal to the ratio of the 
observed to calculated loss is defined it is found that it increases rapidly 
with decreasing thickness of sheet. It is suggested that this effect 
arises from neglect of the true mechanism of magnetization, and it is 
shown that the classical calculation of eddy-current loss is invalid 
when the distance between domain walls becomes comparable with 
the sheet thickness. Theoretical predictions seem to be in good 
agreement with observations made by Richards, Walker and Lynch. 


LIST OF PRINCIPAL SYMBOLS 
c = Velocity of light, cm/sec. 
d = Thickness of ferromagnetic sheet, cm. 
e = Electronic charge. 
g = Landé splitting factor. 
m = Mass of electron, g. 
m = Effective oscillating mass of domain wall, g. 
n = Odd integer. 
DP, g = Numerical constants. 
H——imenSec, 
« = Restoring force on a domain wall, dynes/cm?. 
B = Domain-wall viscous damping parameter. 
B. = Domain-wall viscous damping parameter due to micro- 
eddy currents. 
8, = Domain-wall viscous damping parameter due to spin 
relaxation. 
5 = Loss angle, rad. 
5. = Loss angle due to micro-eddy currents, rad. 
6, = Loss angle due to spin-relaxation wall damping, rad. 
dp = Loss angle due to rotational spin relaxation, rad. 
7” = Ratio of measured to calculated eddy-current loss. 
A = Spin relaxation damping constant, sec!. 
ps = Total magnetic permeability, gauss/oersted. 
}4w = Magnetic permeability due to wall movement, 
gauss/oersted. 
/tr = Magnetic permeability due 
gauss/oersted. 
p = Electrical resistivity, e.m.u. 
X w = Magnetic susceptibility due to wall movement. 
Xp = Magnetic susceptibility due to domain rotation, i.e. 
rotational susceptibility. 
Xx, = Rotational susceptibility at angular frequency w. 
Xp.o = Rotational susceptibility at zero frequency. 
w= Angular frequency of magnetic field and magnetiza- 
tion, rad/sec. : 
A = Exchange energy per unit volume, ergs/cm?. 
H = Magnetic field, oersteds. 
M, = Saturation intensity of magnetization. 
K = Magneto-crystalline anisotropy constant, ergs/cm?, 
] = Half-thickness of ferromagnetic domain, cm. 
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(1) INTRODUCTION 


In recent years considerable progress has been made in pro- 
ducing magnetic alloys possessing very high initial permeability 
and low hysteresis loss. The most important of these are 
developments of binary alloys of iron and nickel containing 
about 78% Ni. Addition of copper and/or molybdenum 
enables initial permeabilities of over 100000 to be attained. 

These alloys, hereafter referred to simply as high-permeability 
alloys, have been developed for use as cores for small transformers 
where lightness and compactness are the overriding requirements. 
In order to design transformers which will operate satisfactorily 
at high frequencies the practice adopted is the usual one of 
making the cores with laminations sufficiently thin so that, at 
the operative frequency, the skin depth is greater than the thick- 
ness of the lamination. For frequencies higher than a few 
kilocycles per second, very thin laminations have to be used, 
and the production of sheets of the required thickness presents a 
difficult task. It is therefore all the more perturbing to find that, 
when the very considerable technical difficulties have been over- 
come and the sheet thickness has been adequately reduced 
(usually by cold-rolling), the properties of the material in thin 
sheet form are by no means as desirable as they were originally. 
Most important from the designer’s point of view is that the 
eddy-current losses* in very thin sheets are always greater than 
those calculated from the standard classical formulae. The paper 
is concerned with the characteristics of this anomalous loss. 
A possible explanation is given which seems to be borne out by 
the experimental results available. 


(2) CHARACTERISTICS OF THE ANOMALOUS LOSS 


It has become the practice to describe the anomalous loss in 
terms of the ratio of the measured loss to that calculated from 
classical eddy-current formulae. It is unfortunate that this ratio 
has been given different symbols by various authors. Richards, 
Walker and Lynch! use the symbol g, but in order to avoid 
confusion with the Landé splitting factor, the symbol 7, which 
is used by Feldtkeller, is used in this paper. 7 is a meaningful 
quantity only if the frequency is specified. However, it appears 
that it is substantially independent of frequency below about 
100kc/s. In other words, the anomalous loss behaves like an 
eddy-current loss as far as its frequency dependence is concerned. 
This rules out immediately any explanation in terms of hysteresis 
and losses of the Jordan type? since they give rise to loss angles 
independent of frequency, and so, if they were present, 7 would 
tend to infinity at zero frequency. 

Anomalous losses of the type under consideration seem to 
occur in all materials, although most information exists for high- 
permeability alloys of the Permalloy type. This is probably 
solely due to the fact that these are the alloys which have been 
rolled thinnest. It thus appears that the anomalous eddy- 
current losses are not a prerogative of high-permeability materials, 
and there is no close correlation between 7 and p. 

The most striking and characteristic feature of 7 is its marked 


* In the paper the term ‘eddy-current loss’ is used in a restricted sense to mean 
the loss measured in a sinusoidally varying field of such small amplitude that the 
permeability is sensibly constant and hysteresis loss negligible. Similarly, the term 
‘permeability’ is taken to mean initial permeability throughout. 
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dependence on sheet thickness. It increases sharply with 
decreasing thickness, and for a 3-micron Permalloy strip Abgrall 
and Epelboin? found that 7 equals 18. Measurements by 
Richards, Walker and Lynch show that, for a Mo—Permalloy of 
composition Ni 77%, Cu 5%, Mo 4% and Fe 14%, the loss 
ratio can be expressed as 7 — 1 = ka~3/* where k is a constant 
and a is the strip thickness. Moreover, for values of a greater 
than about 200 microns, 7 = 1:0 within the limits of experi- 
mental error. This is important since it indicates that there is 
nothing seriously wrong with the classical eddy-current calcula- 
tion for sheets whose thickness is, by ordinary engineering 
standards, quite small. 

An explanation often given for these facts is the presence of 
surface layers whose permeability is less than that of the bulk 
material. These could arise quite naturally from surface closure 
domains or could be due to impurities existing only on the 
surface. Evidence for the existence of such layers has been put 
forward by Peterson and Wrathall,* who inferred their presence 
from measured eddy-current losses, and more directly by 
Epelboin,> who measured the initial permeability of a Permalloy 
tape at each stage in the removal of successive layers from the 
surface by electropolishing. Feldtkeller® showed that low- 
permeability surface layers would give rise to loss ratios greater 
than unity, but the maximum value of 7 predicted by his analysis 
is about 1-6, which is much too low to account for the observed 
values. Furthermore Richards, Walker and Lynch consider 
that the low-permeability surface layers encountered by Peterson 
and Wrathall are definitely attributable to impurities. These 
layers do not exist in special high-purity alloys. The anomalous 
losses, however, still persist. It thus appears that Feldtkeller’s 
mechanism, if present at all, can only account for part of the 
excess loss. 

A more realistic approach to a somewhat similar problem is 
that of Aspden,’ who calculates the power loss to be expected 
in a sheet containing a single-domain wall in the plane of the 
sheet. The results are compared with the power loss measured 
at times at which the induction is instantaneously zero. The good 
agreement obtained by Aspden is, in the author’s opinion, to be 
expected, since this model of the domain structure is probably 
quite good for the conditions under which the experimental results 
were obtained, i.e. large peak induction, measured at a field near 
the coercive force where large domains of reverse magnetization 
are expanding rapidly. There is, however, no reason to expect 
Aspden’s model to be appropriate to the situation in the very 
low fields considered here. 

It seems that the only way of explaining the anomalous eddy- 
current loss in such a way that the loss ratio 7 is independent of 
frequency (or very nearly so) is through the agency of a relaxation 
mechanism. In this paper the suggestion is put forward that 
such a relaxation mechanism appears quite naturally if a more 
realistic approach is made to eddy-current losses, taking into 
account what is known about the domain structure existing in 
ferromagnetic materials. 


(3) THEORY 

It is generally believed that, in low fields, magnetization takes 
place by the reversible displacement of walls separating regions 
magnetized in directions determined by the crystal anisotropy of 
the material. Considering two such regions separated by a 
domain boundary, it is clear that the net change in magnetiza- 
tion is produced by very large changes in local magnetization 
in the immediate neighbourhood of the domain boundary. At 
points far removed from the boundary the material is unaffected 
by an external field (apart from any rotational permeability 
which is neglected for the time being). In other words, one 
must attribute a very large local permeability to the domain wall 
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since the permeability at large distances from it is unity. A 
moving domain wall giving rise to large local changes in mag- 
netization will set up micro-eddy currents which will act in such 
a way as to oppose the wall movement. This effect was con- 
sidered in some detail by Williams, Shockley and Kittel,’ who 
showed that micro-eddy currents thus produced act so as to 
introduce a damping term in the equation of motion of the wall. 
In addition, a damping of the wall movement will occur owing 
to inherent damping of the spins in the domain wall. This 
damping, first discussed by Landau and Lifshitz? is of obscure 
origin. The mechanism is the same as that which gives rise to 
the finite width of the ferromagnetic resonance line, and for 
ferrites (in which eddy-current damping of all kinds is negligible) 
the damping parameter obtained from domain-wall experiments 
is in good agreement with that found from the resonance 
experiments. !° 

To take account of domain-wall damping, consider a model 
such as is depicted in Fig. 1, with a 90° wall lying in the zx-plane 


Fig. 1.—Cross-section of part of a ferromagnetic sheet. 


The sheet is supposed to be of infinite extent in the x and y directions. 
The domain wall at A is moving from left to right. 
The dotted lines show the approximate distribution of eddy currents. 


at A. Let the depth of the domains be d, which is taken parallel 
to the z-axis. If a field H is applied in the xy-plane into the 
plane of the diagram, the wall at A will tend to move to the 
right. In an alternating field Hoe’! the equation of motion of 
the wall will be 


my + By + ay = pHM, cos Oe . . . (1) 


where p equals 4/2 for a 90° wall and 2 for a 180° wall. The 
effects of wall inertia do not begin to make themselves felt until 
frequencies of the order of some megacycles per second have 
been reached,!! and hereafter the first term will be neglected. 
Eqn. (1) becomes 


By + ay = pHM, cosbe* |. . |. . Qa 
which has as its solution 
HM, cos 0 1 ' 
ees Jot, ~ eee 


4 1 + jeoBlo* 


If the distance between successive domain walls is 2/ this dis- 
placement will produce a change of magnetization in the direction 
of the field equal to pM,y cos 0/21. The susceptibility is then 
p?M2 1 
30) LcigexB]aaeeea iene ie 4) 


in which cos? @ has been replaced by its average value of one-third. 
The frequency variation is thus of a relaxation type with a 
relaxation frequency equal to a/8. The loss angle for wall 
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movement, 6, is given by tan 8 = w/a and is thus independent 
of the type of wall. 

As previously mentioned the damping constant, B, is in 
general made up of two parts, 8, caused by micro-eddy currents 
and f, due to intrinsic damping of the spins. The calculation 
of 8. has been carried out by Williams, Shockley and Kittel® for 
a single wall and by Polivanov.!2 The latter’s results are to be 
preferred, since he considers an array of walls transverse to the 
plane of the sheet. He thus obtains a result which takes into 
account the fact that the z-components of the currents set up 
by neighbouring moving walls tend to cancel out between the 
walls, and therefore the result converges to a limiting case as 
the distance between neighbouring domain walls is decreased. 
The current distribution is shown diagrammatically in Fig. 1. 
Polivanov obtains the following result for the loss tangent 
produced by micro-eddy currents: 


32ued? 1, 
71” p hae 


nil 


tan 6, = n? coth —- at t(D) 
where &’ indicates a summation over odd values of #. This 
result holds for an array of domain walls which extend through 
the sheet perpendicular to the plane of the sheet and along the 
direction of the applied field. It also assumes that the domain 
walls remain plane, so that the following calculation is valid 
only for the case in which the skin depth is greater than the 
sheet thickness and the field penetrates the sheet without sig- 
nificant change of amplitude. In an actual polycrystalline sheet 
the domain walls will not necessarily extend through the sheet 
but will be bounded by the crystal grains. Each moving domain 
wall will set up its own system of eddy currents, and although 
these may cancel out in the interior of the sheet, there will be no 
cancellation at the surfaces. Thus the model of transverse 
domains extending through the sheet is likely to be a good one. 
It remains to show that eqn. (5) reduces to the classical value for 
a sheet of thickness d and uniform permeability 2 when the 
distance between the domain walls becomes zero. 


1 
Now Lim. oy n~>coth as = — > n=4 
1/d—>0 d d T 
= 773/96 
/and so as //d > 0, 
mpd” 
tan 0, = 
3p 


which is just the classical value. 
(o4 
The damping parameter 6, = = tané6,. From eqn. (4) 


2p?*7M?2 
= ——— 


if 1 and w = 0, and so 
ul tp > 

64p*M 2d &’ n-3 coth ae 
(6) 


Be = 


| It is necessary to add that part of the damping caused by intrinsic 
‘damping of the spin system within the wall. Galt!? calculates 


‘this spin relaxation term to be 


gA\K |12 
B, i id 2y?All2 


377p 


|i which qg is a constant approximately equal to 2 for a 180° 
‘wall and unity for a 90° wall. The total loss tangent is therefore 


'g ven by w(B, + BD 


a 


oe gas = 
a 
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and therefore the loss ratio 7, i.e. the ratio of the actual loss to 
that calculated from classical eddy-current theory, is 


‘eal 
d: Vagal Kl 
2y2All2 


64p2M 2d &’' n-3 coth 
9pl 


1 Opa Sa? 


37p 


This is an exact expression for the case in which magnetization 
occurs solely by domain-wall displacements, i.e. for all except 
very-high-permeability materials in which the rotational per- 
meability may form an appreciable factor of the whole in small 
fields. The necessary modification for these materials is made 
at the end of Section 4. 

In the absence of spin relaxation effects, eqn. (7) reduces to 
the simple form 


nil 


96 1 
a COL === 
n—3 cot 7 


This expression is plotted in Fig. 2. Empirically, by plotting 
7 — | against //d it is found that, for values of //d less than 


20 
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Fig. 2.—Variation of the loss ratio 7 as a function of ratio J/d. 


This curve represents only the eddy-current losses and does not include the spin 
relaxation contribution. 


unity, eqn. (8) can be simulated fairly accurately by an expression 
of the form 
y= 1+ kd 


where k is a constant and the exponent r lies between 1-5 and 
1-7. Although detailed comparison with experiment is deferred 
until a later Section, simple order-of-magnitude estimates of the 
quantities in eqn. (7) show quite generally that the spin relaxation 
term is not important unless d is very small—about 10 microns. 
The variation of 7 with d predicted by eqn. (7) is then approxi- 
mately as d~" for large values of d. As d is decreased the sum 
in the numerator becomes &’n~3, independent of d, and so a 
d-! law obtains. If d is decreased further the spin-relaxation 
term takes control and is manifested as a d~* dependence. 
It is probable that this would not occur until d was reduced to 
a few microns. This assumes that / is constant. In fact, there 
will certainly be a distribution of values of / for a given (poly- 
crystalline) specimen. Moreover it is by no means certain that / 
will be independent of d. Recent work by Martin!‘ indicates 
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that the most likely relation between these two quantities is that 
lis proportional to d'/?._ These effects will tend to smooth out 
the complicated variation of 7 with d and will also make 7 
decrease less rapidly with d than is predicted by eqn. (8). 
Eqn. (7) will then lead to an approximate law of the form 


7) = 1 + kde, 


for values of d greater than a certain quantity and in which 
Tie 6: 


(4) COMPARISON WITH EXPERIMENT 


The most comprehensive set of measurements with which to 
compare this result are those of Richards, Walker and Lynch! 
on an alloy containing 77% Ni, 14% Fe, 5% Cu and 4% Mo. 
This alloy has, after suitable purification, initial permeabilities 
ranging from 20 to 40000 and a resistivity of 5:7 x 10*e.m.u. 
Measurements made in this laboratory give M, = 600 gauss. 
There is a considerable body of evidence to support the view 
that in soft magnetic materials the movement of 90° walls makes 
the greatest contribution to the initial permeability. The most 
telling evidence for this is that in these materials the highest 
initial permeability is developed in those alloys for which the 
magnetostriction constant in the direction of easy magnetization 
is a minimum. Since movement of a 180° wall is not accom- 
panied by magnetostriction it is difficult to see why zero magneto- 
striction should be so essential to high permeability if 180° walls 
make an appreciable contribution. Accordingly it is assumed 
that p = 4/2. For this alloy A = 2 x 108sec—! (see Reference 
15), y = ge/2mc = 1-76 x 10’ if g = 2. The values of K and 
A are not known, but no great error will be incurred by putting 
K = 500 ergs/cm? and A = 2 x 10~° ergs/cm, the latter being 
the value for pure iron. Substitution then gives B, ~ 107. 
Similarly B, becomes 40d, taking the sum equal to unity. 
This implies that, for a sheet thickness of 2-5 x 10~-4cm, the 
spin relaxation and micro-eddy current terms are equal. Strips 
of such thickness do not appear to be available at present. For 
a thickness of 25 microns the spin-relaxation effects contribute 
only about one-tenth of the total loss, and for thicknesses of 
50 microns and more they are negligible. 

All the terms in eqn. (7) are now known except J. The quantity 
21 is the distance between successive domain walls. Nothing is 
known about this quantity for an alloy of the composition under 
discussion, but one may reasonably expect that 2/ should not be 
greater than the crystallite dimensions since domain walls 
crossing grain boundaries are not normally envisaged in 
unoriented material. In view of the present lack of knowledge 
of the factors which determine the domain sizes in polycrystalline 
metals it is felt to be extremely dangerous to use a priori values 
for 2/. (For a criticism of this procedure see Reference 16.) 
Instead the empirical relation of Reference 1 is used, namely 


Mele kOe oii se oe O) 


where k = 90 if d is in microns, and eqn. (7) is ‘fitted’ to the 
value of 7 given by this expression for a thickness at which the 
spin-relaxation effects are negligible. Putting d = 50 microns, 
eqn. (9) gives 7 = 1:25. This leads to a value for / of 13-5 
microns, so that the domain width is 27 microns. For a strip 
of this thickness the grain size is stated! to be 45-50 microns so 
that there are then about two domain walls per crystallite. 
This seems to be quite a reasonable result. In the absence of any 
evidence to the contrary, this value of / is used for all values of d. 
This is probably not strictly true but it has the advantage of 
simplicity. 

One final modification must now be made. 


Eqn. (7) applies 
only to wall movements. 


There will also be a contribution to 
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the initial permeability from domain vector rotations. These 
give rise to a permeability 
4a M2 
VR ae ee 


For K = 500 ergs/cm? as used here, zp = 3000. This is nearly 
one-tenth of the initial permeability found for these alloys. To 
a first approximation, relaxation losses associated with the 
rotational permeability may be neglected, and so xz will give 
rise to purely classical eddy-current losses for which 7 = 1 
The expected loss ratio 7’ is then 


(10) 
& a 
where 7) is obtained from eqn. (7). Values of 7’ calculated from 
eqns. (7) and (10) are shown in Fig. 3 as a function of d. The 
results are in good agreement with experiment. Perhaps the 
most surprising result is that eqns. (7) and (10) predict a depen- 
dence of 7’ on thickness which is almost exactly an inverse three- 
halves power law. Eqn. (9) is shown in Fig. 3 for comparison. 
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Fig. 3.—Variation of loss ratio 7 with sheet thickness for an alloy 
containing 77% Ni, 14% Fe, 5% Cu, 4% Mo. 
The curve is calculated from eqn. (10). 


The straight line is the empirical relation [eqn. (9)] deduced by Richards et al.! from 
a large number of experimental observations. 


Actually there is some evidence that, in this alloy, the grain size 
increases with the strip thickness, and it is very possible that, 
provided that the crystallites are less than a certain size, the 
domain size is a constant fraction of the crystallite dimensions. 
This is in keeping with the concept of ‘closed flux shell’ intro- 
duced by Martius!” and suggested by Kittel.!8 The effect would 
be to make 7’ vary less rapidly with d than is given by eqn. (7): 


(5) DISCUSSION 
Apart from the spin-relaxation term the factors which influence 
7 are purely the geometrical ones of domain size and specimen 
thickness, so that there is no reason why large values of 7 should 
be confined to one group of magnetic materials. Broadly 
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speaking, eddy-current losses will be greater than those calculated 
on a classical basis, whenever the distance between domain walls 
becomes comparable with the thickness of the sheet. Very little 
is known about domain widths in random polycrystals, but it 
seems unlikely that they would be as large as the thickness of 
ordinary hot-rolled silicon-iron transformer sheet. Thus one 
would expect the eddy-current losses measured in very low fields 
to be equal to the values obtained from classical formulae. In 
fact, measurements made in this laboratory show that 7 = 1 
within the limits of experimental error for types of 0-013in 
thick commercial transformer sheet so far investigated. 

The Situation is somewhat different in the case of grain- 
oriented material. Here there is a considerable likelihood of 
domain walls crossing grain boundaries, and the distance between 
walls may be quite large. These conditions have been revealed 
by the powder-pattern technique. Unfortunately, the patterns 
become indistinct in zero applied field, which is just the region 
of interest. Measurements by Bates and Hart!? on a crystallite 
in a disc of grain-oriented silicon iron indicate domain widths 
which extrapolate roughly to 250 microns in zero field. Since 
the thickness of this disc was 0:32mm, JI/d ~ 0-4, which, from 
eqn. (8), implies that 7 ~ 1-6. However, these measurements 
applied only to one crystallite, and the others may have con- 
tained much smaller domains. Thus, even in grain-oriented 
silicon iron, 7 ~ 1 for the usual sheet thicknesses. 

It is nevertheless to be expected that, in grain- or domain- 
oriented materials in which comparatively few domain walls are 
believed to exist, high values of 7 will set in at higher thicknesses 
than for random material, and this seems to be borne out by the 
measurements of Parkin?® on a 50/50 nickel-iron alloy where 
grain orientation increased » by about 50%, and on a 65/36 
nickel-iron in which a magnetic anneal increased 7 from 1-18 
to 2-20. Similar results have been obtained by Buckley et al.?! 

In the previous Section it was suggested that, if the domain 
size were a constant fraction of the grain size, 7 would be pro- 
portional to the latter. Some measurements made by Parkin?? 
on a Mumetal-type alloy seem to support this view and are 
shown in Fig. 4, from which it is evident that 7 = 1 when the 
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Fig. 4.—Observed variation of loss ratio 7 and permeability with grain 
diameter for a Mumetal-type alloy—Parkin.2° 


zrain diameter is reduced to zero. At present, however, there is 
‘nsufficient evidence to show that this is a typical result. 

There next arises the possibility of a correlation between 7 
and p. For the spin-relaxation part, the situation is simple. 
An most inclusion models |K |"/2 oc w—!, and so 7 varies as p—'. 
*t is not so easy to decide in connection with the eddy-current 
damping term. From eqn. (7) it would appear that, since 7 is 

: By NAD = e 
roportional to / and pp = 2p*7M3/3al, then y © pb’. How 
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ever, this ignores the fact that « is not independent of /. Much 
more likely, in the author’s view, is that « increases as / decreases, 
and so the variation of 7 with js depends on the manner in which 
« varies with /. Thus, although for a given set of experimental 
conditions, i.e. heat treatment, etc., one may observe a definite 
correlation between 7 and yp, it is not possible a priori to predict 
the form of this correlation. 

These considerations also apply to the possible temperature 
dependence of 7. Broadly speaking, one would expect 7 to be 
almost temperature independent, but if 7 does, in fact, vary 
with p, a temperature variation in one will be reflected in the 
other. Genuine temperature variations in 7 seem to be possible 
only through the intervention of the spin-relaxation term. For 
most materials K increases rapidly at low temperatures, and, 
in addition, A increases at high temperatures,?° 73 and so the 
spin relaxation should become more important at both high and 
low temperatures. 

It is perhaps pertinent to ask whether, for a given thickness 
of sheet, it is possible to reduce 7 in any way. One way might 
be to reduce the domain size by decreasing the grain size. The 
danger is that, if the reduction in grain size increases « more 
rapidly than it decreases the domain size, the permeability will 
fall. This is, in fact, the case with the measurements of Parkin 
shown in Fig. 4, which indicate a serious drop in permeability 
when the grain diameter becomes less than about 40 microns. 
A more hopeful way of reducing 7 would seem to be to reduce 
the anisotropy more nearly to zero. This would have the effect 
of increasing the domain-wall width, since this is proportional 
to K—'/2, until ultimately the whole material would be composed 
of walls and a pure rotational permeability would result. In 
either case the permeability would still be subject to inherent 
damping of the spin in much the same way as the spin system is 
damped in a ferromagnetic resonance experiment. Kéittel** 
obtained the following expression for the frequency dependence 
of the rotational susceptibility (equal to a good approximation 
to -4r/4z for the high permeabilities considered here): 


Wes wh + B(ie +3) 


Xr.0 A\2 iN 

9 oo? + (%) +2joy, 
in which wo = y(2K/M,) and X’ is an average rotational sus- 
ceptibility approximately equal to M?/3K. For frequencies low 
compared with wo, this gives rise to the following loss tangent: 


oh 1 


tan Or — 7 
MOTE. 


Rotational processes produce a uniform change in magnetiza- 
tion for which the classical eddy-current formulae should be 
correct and so the rotational relaxation gives 


— 27pA 
7 1672M 22d? 


For the alloy considered in Section 4,7 — 1 = 2 x 10-8 x d-?, 
so that 7 = 3 when d= 1 micron. The smallness of this value, 
especially for large values of d, is the justification for neglecting 
rotational losses in eqn. (10). It also indicates that, if domain 
walls can be removed and a sufficiently high permeability obtained 
from rotations alone, 7 would be much reduced. However, in 
order that 4 = 40000, K would have to be no greater than 
40 ergs/cm?. In an alloy this would probably be difficult, since, 
in a typical nickel-iron alloy, it requires a precision in the nickel 
content of +0:01%. Local variations in composition might 
change K by this amount from one point to another through 
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the material unless the degree of homogeneity were unusually 
good, 


(6) CONCLUSIONS 


It is concluded that, in any ferromagnetic material subject to 
an alternating magnetic field of sufficiently small amplitude so 
that hysteresis loss is negligible, the measured losses will always 
be greater than those calculated on the assumption of a uniform 
permeability. This discrepancy should exist in all materials, 
not merely high-permeability alloys, and it becomes more serious 
as the thickness of the sheet is reduced. The ratio 7 may be 
expected to vary with d according to an approximate law of the 
form 7 = 1 + kd~’, in which r may itself be a function of d 
but always lies between 1 and 2. The effect is caused by 
properties inherent in a ferromagnetic body and can never be 
entirely eradicated. It is possible, in principle, to reduce it by 
increasing the number of domain walls, i.e. by reducing the 
average domain size. Practically, the only way of doing this is 
to reduce the grain size of the material. Since such a reduction 
would almost inevitably be accompanied by a decrease in per- 
meability some compromise would have to be reached in any 
practical arrangement. 
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SUMMARY 

A theoretical analysis of certain aerial problems is carried out with 
the help of the idea of a spectrum of plane waves introduced by Booker 
and Clemmow. The reciprocity theorem, which relates the behaviour 
of an aerial used for reception to the properties of the same aerial as 
a transmitter, is used to derive an expression for the power received 
by one aerial as a result of transmission from a second aerial at any 
distance from the first. It is shown that under the conditions used in 
aerial measurements the size of the receiving aerial can influence the 
errors in measured radiation patterns and power gains. In particular, 
the side-lobe levels obtained when too small a separation between the 
aerials is used can either be larger or smaller than the levels for the 
true radiation pattern. This result differs from that predicted by 
diffraction theory, which suggests that side-lobe levels measured at too 
small a distance are always worse than those in the true radiation 
pattern. Similarly, when the size of the receiving aerial is taken into 
account, errors in gain measurements are shown to be larger than those 
predicted by diffraction theory. 


LIST OF PRINCIPAL SYMBOLS 


a = Aperture length. 
x, y, Zz = Cartesian co-ordinates. 
r, 0, @ = Spherical polar co-ordinates. 
i, j, K = Unit vectors in directions of x-, y- and z-axes. 
i,, ig, ig = Unit vectors in r, @ and ¢ directions. 
Eo, fo = Absolute permittivity and permeability of free 
space. 
Zo, Yo = Wave impedance and admittance of a plane wave in 
free space. 
k = Free-space plane-wave phase coefficient (=27/)). 
S; = sin 6 cos ¢. 


S> = sin 0 sin ¢. 
C= c0s.0: 

Cp = Cos Op; 

So = sin). 


F(S,, S>) = Plane-wave spectrum radiated by an aerial. 
_G(6, d) = Power gain of an aerial for the direction 0, d. 
A.g(0, &) = Effective receiving area of an aerial for the direc- 
tion 0, d. 
Br, Br = Half-power beam widths of receiving and trans- 
mitting aerials. 
D =Complex amplitude of signal in receiving-aerial 
feeder. 
L = log,2. 
p = 4L|Bf — jkr. 
” = arc tan (krBR/4L). 


(1) INTRODUCTION 


In measurements of the radiation pattern or power gain of 
an aerial, errors can arise if the test site is insufficiently large. 
The calculation of these errors is usually based on classical 
diffraction theory and leads to the conclusion that the distance 
between the aerial being tested and the fixed aerial must exceed 
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the Rayleigh range. In such calculations the size of the fixed 
aerial is neglected. The object of the paper is to examine the 
effect of this on the errors, and the method of calculation uses 
the concept of an angular spectrum of plane waves, introduced 
by Booker and Clemmow.! This not only provides a direct 
derivation of the results desired, but enables us to relate the 
problems arising in aerial-pattern measurements to those in the 
determination of the frequency spectrum of a time waveform. 


(2) THE ANGULAR SPECTRUM OF PLANE WAVES 


The basic principle in the method developed by Booker and 
Clemmow is that any electromagnetic field can be represented 
by the addition of plane waves, including the possibility that 
some of these waves may be inhomogeneous and of the type more 
commonly described as surface waves. In radiation problems, 
the amplitudes of these waves can be calculated if either the 
electric field or the magnetic field is known over a suitably 
chosen aperture plane; this aperture field can usually be calculated 
by optical methods relatively easily from the geometry of the 
aerial and a knowledge of the feed radiation pattern. The 
aperture distribution is then expressed as a Fourier integral, and 
the radiated field can be calculated by relating the component 
exponential terms in the integral to plane waves. The details 
of the calculation are described by Booker and Clemmow for 
2-dimensional problems, and the method is readily extended to 
3-dimensional problems. 

Suppose, for example, that the electric field is linearly polarized 
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Fig. 1.—Radiating aperture, showing the co-ordinate systems used in 
the analysis: the aperture lies in the plane z = 0 and may be of 
infinite extent. 


in the aperture plane z = 0 (Fig. 1), the direction of this field 
being parallel to the x-axis. Then 


1 foe) ee) ; ‘ 
Exlx, 7,0) = 55] | FOSS» exp (—ikSyx — jkSzy)d5,dS; 


(1) 


where F(S,, S>) is the Fourier transform of E,(x, y, 0), the aper- 
ture-plane electric field. Furthermore, the Fourier transform 
can be calculated from E,(x, y, 0) by the inverse relation 


F(Si, Ss) =| [ EGs,», 0) exp (ikSyx + ikSyp)dxdy . Q) 
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Now any plane wave travelling in a direction specified by the 
polar angles @ and ¢ of Fig. 1 has an x-component of electric field 
given by 


E(x, y, z) = Eexp [—jk(x sin 8 cos + y sin @ sin + z Cos 6)] 
(3) 
E being an amplitude constant. When z = 0, the variation with 


x and y is of the same form as that in the integrand of eqn. (1), 
provided that 


S; = sin 6 cos ¢ and S, = sin @ sin ¢ (4) 


We can therefore regard eqn. (1) as stating that the aperture field 
arises from a collection of plane waves travelling in all possible 
directions, the amplitude of the wave in the direction given by 
S;, Sp being F(S,, S,). Each of these waves varies with z in the 
manner specified by eqn. (3), so that we find the radiated field 
to be 


Ex(,952) = 5a] _[ FOSt Sp) exp [“Jk(Syx + Sey 
ae + Cz)]dS,dS, 
Ca = 57 spi 


(5) 
(6) 


When @ and ¢ are real angles, the numerical values of S; and Sy 
are such that (S? + $3) 1, C then being real; the positive 
value of the square root in eqn. (6) is used. The Fourier repre- 
sentation, however, requires that S; and S, can assume any real 
values, so that C may become imaginary; if the root in eqn. (6) 
is now selected to make C a negative imaginary quantity, the 
field decays exponentially with increasing positive values of z. 
A wave with a progressive phase change parallel to a surface, 
here the plane z = 0, and an exponential decay perpendicular to 
this surface is usually called a surface wave, but can be regarded 
as a simple extension of the ordinary plane wave. 

Expressions similar in form to eqn. (5) can be obtained for the 
other field components from Maxwell’s equations by an equiva- 
lent generalization of the equations corresponding to (3). The 
component £,, is everywhere zero. Expressing the fields vec- 
torially we find that 


in which 


E@ y=) | (4 — SWF, Sy exp [Sx + Soy 
4+ C2)]dS,dSIC . (1 


Y, ive) co , ; 
He %2=58) | [SiS + 0 SDj — CK] 
F(S;, S») exp [—jk (Six + Spy + C2]dS,dS,/C . (8) 


where Yo = (€9/4o)!/? is the free-space plane-wave admittance. 

It may be noted that, when the aperture field has no y-com- 
ponent of electric field, this component vanishes everywhere. 
Corresponding results can be derived when the aperture electric 
field has no x-component, and the most general case can always 
be resolved into a sum of the two linearly polarized cases. 

The field at a point sufficiently distant from the aperture plane 
can be calculated approximately by a double application of the 
stationary phase method to the integrals in eqns. (7) and (8): 
again this is a simple extension of the results given by Booker 
and Clemmow. If the distant point is specified by the spherical 
polar co-ordinates (r, 0, 4), then 


E(r, 0, 6) = jee oe (cos dig — sin ¢ cos Bis) 
F (sin 6 cos ¢, sin 6 sin d) 
H(r, 0, ) = Yoi, x E(r, 0, d) 


(9) 
(10) 
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From eqn. (9) we see that the dominant contribution to the 
variation of the field with direction arises from the function F, 
which can therefore be interpreted as the amplitude radiation 
pattern. Restrictions on the interpretation of the plane-wave 
spectrum as a radiation pattern are the same as in the 2-dimen- 
sional problem, namely that the field is being considered at a 
distance from the aperture which is large compared with the 
wavelength and with the dimensions of the aperture plane over 
which there is appreciable illumination. 

Eqn. (9) is similar to the result obtained by using the 
Huyghens-Kirchhoff integral method of physical optics when 
approximations are made appropriate to the distant point 
being in the Fraunhofer region. For example, when converted 
to the present notation, Silver’s result? is 
21.0, = [tbe Hp 


eos gig — sin dig) 
F (sin 8 cos ¢, sin @ sin g) . 


which differs from eqn. (9) only in the angular factors multiplying 
the function F. For small values of 8, cos @ can be taken as 
unity and then the two expressions are identical. The different 
angular factors, namely the extra 4(1 ++ cos 6) in Silver’s expres- 
sion and the cos @ appearing before the vector i, in eqn. (9), may 
be referred to as ‘obliquity factors’. 

The two solutions differ, although they are apparently based 
on identical approximations, namely the assumption that kr = 1. 
The most significant difference is that, whereas the Booker- 
Clemmow method relies only on a knowledge of the electric field 
in the aperture plane, the Kirchhoff method requires that both 
the electric and magnetic fields should be specified. These fields 
are not necessarily consistent, and to allow for this, additional 
line distributions of electric and magnetic charge are introduced 
along the boundary of the aperture, which is taken as finite in 
extent. A simple relation between the electric and magnetic 
fields, usually equivalent to assuming that the field in the aperture 
is part of a plane wave, is then postulated. It is doubtful whether 
this relation can ever be satisfied, so that the discrepancy between 
eqns. (9) and (11) probably arises from the incompatibility of the 
electric and magnetic aperture fields used in the derivation of 
eqn. (11). The difference between the two expressions is very 
slight, however, and in practical applications to narrow-beam 


aerials, @ can usually be assumed to be small enough to make — 


cos # unity when, as mentioned above, the two expressions agree. 
In this case the magnitude of the electric field strength is directly 


proportional to |F|, which can therefore be regarded as the 
radiation pattern. 


(3) APPLICATION OF THE ANGULAR SPECTRUM TO 
RECEIVING AERIALS 


The idea of an angular spectrum of plane waves is of con- 


siderable help in calculating the power transferred to a receiver _ 


by an aerial when the wave incident on the aerial has a compli- 
cated form. The incident wave can be expressed as a spectrum 
of plane waves, the signal at the receiver being determined by 
summing the contributions of the individual plane waves. A 
knowledge of the behaviour of the aerial when illuminated by a 
single plane wave is required, and this can be deduced from the 
radiation pattern of the aerial when used as a transmitter, 
coupled with the reciprocity theorem. 


When the reciprocity theorem of circuit theory is applied to 
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aerials it can be shown that the radiation patterns of the aerial 


are identical whether it is used as a transmitter or a receiver. 


For the present analysis we require a slightly more general result 


from which the amplitude and phase in the aerial feeder can be 
calculated when a plane wave is incident on the aerial from any 
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arbitrary direction. This can be derived from the Lorentz 
reciprocity theorem for electromagnetic fields, and details will 
be given in a separate paper. 

Consider the arrangement shown in Fig. 2, in which a feeder— 
either a waveguide or a transmission line—is connected to an 
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AERIAL 
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Fig. 2.—Arrangement of aerial and waveguide feed used for the 
derivation of the reciprocity theorem. 


aerial, and select some reference point P in the feeder. Suppose 
that the impedance of the line is normalized, so that a wave of 
unit amplitude in the line carries unit power, and refer all phases 
to P. Ifa unit wave of zero phase at P travels towards the aerial, 
radiation will occur, and the corresponding radiation field will 
be of the form given by eqns. (9) and (10). To emphasize that 
these now correspond to a unit-amplitude wave we will attach 
the suffix 1 to the function F. 

_ When the aerial is used as a receiver, a plane wave will be 
mecident on the aerial from some arbitrary direction. We will 
retain our restriction that the aerial radiates waves with no 
y-component of electric field and choose the incident wave 
similarly. This will be of the form 


E(x, y, Z) = E(C'i + S{k) exp jk(Sjx + Sjy + C’z) . (12) 
(x, y, Z) = YoE{S)S3i — [1 — (S))7]j + C’Sk} exp jk(Syx 
+ Ssy + C’z) (13) 


where E is a complex quantity specifying the amplitude and 
phase of the incoming wave, and C’, S; and Sj are related to 
&’ and ¢’ (angles defining the direction of the incoming wave) by 
equations similar to (4) and (6). 

This incoming wave will cause a signal in the feeder travelling 
from the aerial towards a receiver. From the extension of the 
reciprocity theorem referred to above it is found that the complex 
quantity representing the amplitude and phase of this wave at 
the reference point P is given by 


A =}EY,(1 — S57)F\(S}, S5) 


and is therefore related to the angular spectrum F. 

It may be noted that this equation leads to the well-known 
result relating power gain, G(@, $), to the effective receiving area, 
A(C,49, f), namely 


Eqn. (14) also gives the phase of the wave received by the aerial. 


(14) 


(4) TRANSMISSION BETWEEN TWO AERIALS 


(4.1) Exact Result for Received Signal 


The problem of calculating the power transmitted from one 
aerial to a second is usually dealt with by assuming the distance 
setween the aerials to be sufficiently great for the far-field approx- 
imation to apply. This is perfectly satisfactory when con- 
sidering, for example, two stations of a relay network or the 
derivation of the radar equation, when the target effectively 
sehaves both as a receiver and a transmitter. In measuring the 
radiation pattern and power gain of an aerial, however, the site 
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is often so small that the far-field solution is not sufficiently 
accurate, and aerial designers must determine what errors can 
arise in the measurements and how to reduce them. Most of 
the calculations related to this have been based on extending the 
physical-optics solution to the Fresnel diffraction region, but in 
this method it is not easy to make appropriate allowance for 
the aperture sizes of both aerials. The plane-wave-spectrum 
approach makes it possible to derive an integral from which the 
power transmitted between aerials of any size can be calculated, 
and approximate methods of evaluating this integral can be used 
to estimate the effect of operating at too small a separation 
between the aerials. The integral will be derived for the situation 
of most interest, namely that of measuring the radiation pattern 
in one of the principal planes of the aerial. 

From the reciprocity theorem it is obvious that the aerial under 
test can be used either as a transmitter or a receiver. In the 
remainder of the paper we shall assume that the test aerial is used 
as the transmitter and that the arrangement is as shown in Fig. 3. 


RECEIVING 
AERIAL 


TRANSMITTING 
AERIAL 


Fig. 3.—Arrangement of aerials for the measurement of the radiation 
pattern of aerial T, used as a transmitter. 


TO is the direction of maximum radiation from T. 
R is directed so that the maximum of its radiation pattern is in the direction RT. 


The test aerial, T, and fixed receiving aerial, R, are arranged so 
that their directions of maximum radiation lie in the plane of the 
Figure: in addition, R is so oriented that the maximum of its 
radiation pattern always points towards T. The radiation pattern 
is then measured by rotating T about an axis perpendicular to 
the plane of the Figure and observing the output from a receiver 
connected to R as a function of the angle, 6), between the direc- 
tion TR and the direction of maximum radiation of T. If the 
aerial separation, r, is sufficiently large, the observed radiation 
pattern will be the true far-field pattern, which is related to the 
plane-wave spectrum as discussed in Section 2. 

The principle of the calculation is quite straightforward: the 
field radiated by T is expressed as an angular spectrum of plane 
waves. Each wave will cause a contribution to that in the wave- 
guide connected to the aerial R, and the total received signal is 
obtained by adding the contributions, allowing for any phase 
differences, i.e. fields and not powers must be added. The 
generalized reciprocity theorem of Section 3.1 provides the 
necessary information on the phase differences between the 
various contributions. Suppose, for example, that the plane of 
Fig. 3 is taken as the plane x = 0 and that unit power is radiated 
by T. Then, the radiated field is given by eqns. (7) and (8), 
with F replaced by F7. The component plane waves must now 
be referred to the co-ordinate system x, y’, z’, with respect to 
which the properties of aerial R are known. Consider one 
plane wave for which 


E = E(S;, S>) (Ci — S,k) exp [—jk(S,x + Spy + Cz)] (16) 
The co-ordinates y and z are related to y’ and z’ by 


y = (% = 2)59 — YC 
zZ=(r —7Z')eg + YS 


(17) 
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while the unit vectors j, A can be expressed as 


j= —J'Co — k's 
k =j’sy — k'co (18) 
j', k’ being unit vectors in the directions of the y’- and Z’-axes 
respectively. 
Egn. (29) can now be written as 
E = E(Sj, So(Ci — S,Sof’ + Seok’) exp [—jk(rC’ — x5; 
— y'S, — ZC) (19) 
where 
Si = — Si, S> = — Cso > S5Co and (GE = CCo + RYN) (20) 


The field is expressed in a form to which the results of Section 3.1 
can be applied: it will be observed that there is a y’-component 
of electric field, whereas the aerial accepts only waves for which 
this component is zero. We can overcome this difficulty by 
resolving E into two perpendicularly polarized components and 
rejecting the one which is not accepted by the aerial. For the 
wave given by eqn. (19) the amplitude of the component accepted 
by the aerial is 


E(S;, S2)(C/C’) exp (—jkrC’) (21) 


so that from eqn. (23) the amplitude of the wave in the guide 
connected to R is 


B=tY,(C/C)E(S;, S.)(1 — S{2)FR(S;, $5) exp (—jkrC’) . (22) 


where F*(S{, $3) is the plane wave spectrum associated with 
aerial R. The total signal received by aerial R is now given 
by integrating with respect to S,; and S,. When the incident 
field is generated by the transmitting aerial T, inspection of 
eqn. (7) shows that 


E(S;, S>) = FU(S;, S2)[CH (23) 


Carrying out the integration over the possible range of values of 
S, and $5, we find that the amplitude received by the waveguide 
connected to R when illuminated by T as in Fig. 3 is 


Yorleaiite (| cee: ee Sid 
Dae in J = SPA)FT(S, So)FI(S}, Sy) HE Se 


(24) 


The power received by R is given by |D|?._ Eqn. (24) is exact, 
but the evaluation of the integral in all except simple cases is 
extremely difficult. We will now consider certain simplifications. 


(4.2) Far-Field Approximation 


As already mentioned, the power received by R can be cal- 
culated fairly easily when the distance r between the aerials is 
sufficiently large. The integral can then be integrated by 
stationary phase methods, as shown in Section 7.1, the result 
being 

_ IY ¢0 


py oP (—jkr) FT, so)F K(O,.0). 


The power received by R is given by | D|?, so that 


(25) 
p ; Yoto\ y 7 
ower received = GS) | F700, So)|7|FR(0, 0)|2 . (26) 


If the wave spectra are now replaced by the power gain for the 
transmitter T and the receiving area for R, eqn. (26) becomes 


Power received = G™ (6 5) AR (0, 5) [4ar? (27) 
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which is the result obtained by a simple application of the ideas 
of power gain and effective receiving area. Our general expres- 
sion, eqn. (24), therefore reduces to the appropriate result when 
the distance r is sufficiently great. 


(4.3) Measurement of Radiation Pattern 


The radiation pattern as measured for a given distance r and 
known receiving aerial R is given by |D|*, as calculated from 
eqn. (24), regarded as a function of 4. For sufficiently large r 
the measured pattern agrees with the pattern at infinite distance, 
|F7(4, 7/2)|?, as shown in the last Section. To determine the 
errors in any particular arrangements we must evaluate D(6) toa 
higher degree of accuracy. This will be done for one case of 
particular interest, namely that in which the aerial to be measured 
has an aperture with large horizontal and relatively small vertical 
dimensions. The horizontal radiation pattern then has a smaller 
beam-width than the vertical, and it is usually considered essential 
to measure the horizontal radiation pattern at a distance greater 
than a?/X, a being the horizontal aperture dimension. 

Suppose that Fig. 3 represents a plan view of the arrangement 
used to measure the horizontal radiation pattern of T, and further 
assume that the fixed aerial R has a narrow horizontal radiation 
pattern but a relatively broad vertical pattern. The accuracy 
with which an integral can be evaluated by the stationary-phase 
method depends on the rapidity with which the integrand, 
excluding the exponential term, varies with the integration 
variable. Under the conditions postulated, the integrand, 
excluding the exponential term, varies relatively slowly with S, 
(corresponding to the wide vertical patterns of T and R) but 
rapidly with S,. There will thus be a range of values 
of r, greater than some minimum value, for which the 
integration with respect to S, can be carried out exactly as in 
Section 7.2. If we now replace S$ by sin, we have, from 
eqn. (49), 


4 (in/4 , { 
eae J, F{(@, sin f)Ff[0, sin (09 — )] 


cos (9 — xh) cos? % exp [—jkr cos (Oy — #)]dyb . (28) 


where I is the integration contour of Fig. 4(a), corresponding 
to sin % ranging from —oo to 00. I" is selected to ensure that 


D(4) = 


iy 


(2) (b) 


Fig. 4.—Integration contours for eqns. (41) and (43). 


I cos (A — ys) is negative to give the evanescent field decaying 
in the positive z-direction. This integral can be evaluated as an 
asymptotic series in terms of increasing powers of (1/kr) by the 
stationary-phase method. For the present purpose it is con- 
venient at this stage to specify the radiation pattern of the fixed 
aerial in more detail. The basis of our integration method is 
that only the plane waves travelling nearly in the direction from 
T to R make any significant contribution to the received signal: 
we therefore need consider only the form of the radiation pattern 


a ee 
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for angles close to the normal, i.e. for values of ws near O. A 
good approximation to the shape of most aerial patterns in the 
vicinity of the direction of maximum radiation is given by a 
Gaussian function, so that we can use the form 


Goa —2L0) — 3)? 
=) exp | B ue ik (29) 


where Gr is the power gain of R in the direction of maximum 
radiation and Bp is the half-power beam-width of the horizontal 
radiation pattern of R. Substitution for F¥® in eqn. (28) and 
changing the integration variable to = (0) — w) gives 


E¥'[0, sin y — )] = 2( 


JYoGr 
87Arco 


1/2 
D(8) = ( ) J Filo. sin (09 — €)] cos € cos3/? (89 — § 


Sa che 2 

oa ( use 2LE ) dé (30) 
R 

I” being the contour in Fig. 4(b). The convergence of this 

integral on I” is ensured by the presence of the term cos € in the 

exponential. 

Since we are considering narrow-beam aerials, cos J) can be 
replaced by unity and corresponding approximations made in 
the integrand. This enables an asymptotic series to be obtained 
for the integral, as shown in Section 7.3, and from the results 
- there we have 


iYoGp\ "2 
D(8o) = 4 exp (—jkr + Li WD 


x [FT(6,) + CP YD rro@,) +... 


2|p| 
BAGS. «A, (Oni) elutns : 
@n|Ip/" exp(jnn) FTCA) + ...] (31) 
where |p| = [(Ar)? + (4L/Bp))?]' (32) 
and 7 = arc tan (krB?/4L) (33) 


In eqn. (31), the spectrum F7 is shown as depending on the single 
angle 9, since we are considering only the principal plane, and 
F1(Q)) is the nth derivative of the function F{(@) with respect 
to the angle 0, evaluated for 0 = 6. 

The expression in eqn. (31) has similarities to the correspond- 
ing result obtained by using the Kirchhoff-Huyghens method;3 
in this, no allowance is made for the beam width of the fixed 
aerial, and it can easily be shown that the Kirchhoff-Huyghens 
result is identical to that of eqn. (31) if Br > ©, ice. if the fixed 
aerial accepts incoming signals equally strongly from all direc- 
tions. The more general result allows us to examine the effect 
of the size of the fixed aerial on the accuracy with which the 
radiation pattern of the transmitting aerial can be measured. At 
first sight it would appear that using a directive receiving aerial 
should cause the measured pattern D(9@)) to approximate more 
closely to the desired form F 78), since the magnitudes of the 
terms after the first on the right-hand side of eqn. (31) decrease 
as Br is decreased. In the limiting case of an infinitely large 
receiving aerial, Bg =0, |p| > © and D(6o) « F4(%). This 
shows that, in principle, the pattern of T can be measured to any 
desired degree of accuracy at any distance, provided that a 
sufficiently directive receiving aerial is used. Woonton and 
©arruthers+ have demonstrated this experimentally for small 
horns, although practical difficulties arise because of reradiation 
by the aerials. A problem of practical importance is how large 
tne fixed aerial must be to give a significant degree of improve- 
yaent over a non-directional aerial. The answer to this is com- 
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plicated by the dependence of the phase of the various terms in 
eqn. (31) on Br. We therefore resort to numerical analysis of a 
particular case to determine the nature of the dependence of the 
measured pattern on Bp. 

The example which has been considered is that of a transmitting 
aerial which has a constant aperture distribution. Once again 
we restrict the analysis to large aerials, so that only small values 
of need be considered. Then, the radiation pattern of T is given 
sufficiently accurately by the expression 


Ga Nil sin (zra0/X) 


T = 
ie NG (za |X) a) 


7 Yo 


a being the length of the aperture of T, and the factor 
NGr/27Y)'!2 serving to give absolute values of field strength 
when T radiates unit power. 

When this expression for F7(@) is used, eqn. (31) can be 
evaluated numerically with the help of the tables prepared by 
Milne.? This has been done for a separation equal to the Rayleigh 
distance, a?/A, using several values of the parameter 7 correspond- 
ing to different values of Bg. The maximum number of terms 


' used in the series of eqn. (31) in carrying out the calculations is 


five, ie. derivatives up to F7 are included. The results are 
shown in Fig. 5, being normalized in each case so that the 
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Fig. 5.—Radiation patterns for a uniformly illuminated aerial 
as observed at the Rayleigh distance, a?/A. 
(a) n = 90°; ee receiver. (Pattern as calculated from the diffraction 
theory. 
(b) n = 75°; Br/Br = 1-46. 
(c) 1 = 60°; Br/Br = 0:99. 
(d) 1 = 45°; Br/By = 0°76. 
(e) 1 = 30°; Br/Br = 0°58. 
(f) n = 0°; Br/B- (actual pattern as observed at infinite distance). 


reference level corresponds to the maximum on-axis signal. The 


most interesting features of the set of patterns shown are: 


(a) The measured half-power beam width is only slightly affected 
by the beam width of the fixed aerial. 

(b) The first minimum becomes more marked as the fixed-aerial 
beam width decreases. 

(c) The first side-lobe level at first decreases as the fixed-aerial 
beam width decreases, reaches a minimum and then increases to 
the far-field value when Br becomes zero. 
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The last point is probably the most interesting from a practical 
point of view, and it will be noted that the side-lobe level appears 
too large if a small fixed aerial is used, but is too small if a very 
directive fixed aerial is used. The assumption frequently used in 
practice that side-lobe levels are apparently worse in the pattern 
measured at too short a range than they are in the true pattern 
may not therefore always be correct. The value of 7 for which 
the measured side-lobe level agrees with that in the true 
pattern is about 75°, corresponding to B,r/Br 1°45 (with 
By = 0-88A/a rad). For this condition the length of the fixed 
aerial is roughly two-thirds that of the aerial under test. 

It should be stressed that the above numerical results apply 
only to one particular arrangement, namely the two aerials being 
separated by the Rayleigh range and each having a specially 
selected radiation pattern. It is probable that the results are 
not critically dependent on the shape of the fixed-aerial pattern, 
but on the other hand the behaviour of, for example, the measured 
side-lobe level may well vary with the nature of the pattern under 
test. One general conclusion can, however, be drawn: the 
problem of measuring very large aerials is not greatly eased by 
using a large fixed aerial. In the case being considered it is 
unlikely that a fixed aerial larger than the test aerial can be used, 
giving a minimum value for 7 of about 60°: while the pattern 
for 7 = 60° in Fig. 5 approximates more closely to the actual 
pattern of the test aerial than does that for an omnidirectional 
fixed aerial, it still shows significant errors. 


(4.4) Measurement of Power Gain 


Most methods of power-gain measurement involve measuring 
the attenuation caused by transmission between two aerials and 
then using eqn. (27); either the gain of one aerial is known or 
two identical aerials are used, so that the unknown gain can be 
calculated. The preceding theory enables us to estimate the 
errors arising in any specified arrangement: one example is pro- 
vided by the results in the previous Section if the received power 
is calculated from |D(@))|?.__ The retention of the various con- 
stant terms in eqns. (29)-(34) permits the immediate calculation 
of the power received by R when unit power is transmitted from 
T; the ‘measured gain’ can then be calculated from eqn. (27). 
Results applicable under the conditions of the previous Section, 
i.e. both aerials having horizontal patterns much narrower than 
their vertical patterns and being separated by the Rayleigh range, 
are given in Fig. 6, the on-axis gain of T being plotted against n. 
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Fig. 6.—Dependence of measured gain of transmitting aerial on 
receiving-aerial beam width. 


The ratio of the measured gain to the true gain of a transmitti i i 
) 1 : 1 smitting aerial which has 
nee y porzontal ee and avace vertical beam is plotted against y which is Aaa 
in terms of beam-widths in the caption to Fig. 5. The measu 
Shorey teria rements are assumed to 


It is immediately obvious that the measured gain depends very 
markedly on the size of the fixed aerial R: the least error between 
the measured gain and the true gain Gy occurs for an omni- 
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directional aerial R, the difference of 0-25 dB agreeing with cal- 
culations made by Polk using Fresnel diffraction theory. The 
error increases very rapidly with decreasing values of Br and 
becomes 1-69dB at 7 = 60°, which corresponds quite closely to 
the situation when two identical aerials are used. For a very 
large receiving aerial (7 = 0), the received signal tends to zero, 
as does the measured value of the gain of the transmitter aerial. 

A more general result for two identical aerials can be obtained 
directly from eqn. (24); if the radiation pattern maximum is 
directed along the aerial axis, the gain is usually measured only 
for the two aerials pointing directly at each other, i.e. for 05 =0! 
Then 


Si = — 5,33, = S22 = (35) 
giving 
Vatiieck Gee ISR 
= 55 i 3 [ (C+SDAG, S2)F{(—S1, S_) 2° ——* 30) 


where the indices J and R can now be omitted, since the two 
aerials are identical. We further restrict the discussion to aerials 
with constant phase symmetrical aperture distributions, i.e. for 
which 


FAX, Ys 0) a WO a ames 0) = EXG; y, 0) (37) 


For such aerials F(S,, S,) is a real symmetric function of S; and 
S>, so that the integrand in eqn. (36) can be expressed in terms of 
the power gain, giving 
1 ioe) OO . 
D=,-| | G(S,,S;) exp (—jkrO)dS;dSyIC . G8) 
47 J_ 4 @ 

The integral can again be integrated by the stationary-phase 
method, the dominant term corresponding to the result in 
eqn. (27). We shall consider in detail only the case of an aerial 
whose pattern is symmetrical about its axis, e.g. a paraboloid 
with a circularly symmetrical aperture distribution. The function 
G is then an even function of @ and can therefore be regarded as 
a function of C. The evaluation of the integral when this is so 
is considered in Section 7.4, and the result is applied to a uni- 
formly Uluminated circular aperture. Numerical results for the 
difference between the measured gain and the actual gain are 
shown in Fig. 7 as a function of the separation of the aerials. 
It will be observed that the measured gain is noticeably less than 
the actual gain when the separation is less than d?/A, d being the 
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Fig. 7.—Dependence of measured gain on aerial separation. 


The curve is calculated assuming that the measurement is made using two identical 


aerials of circular apertures, each having the diame . istributi 
oni eee 5 g ter d; the aperture distributions are 
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aerial diameter, and that the difference is 0-86 dB at the separa- 
tion d?/X. This figure is considerably larger than the 0:27dB 
estimated by Silver* by a diffraction calculation, this correspond- 
ing to the assumption of a small receiving aerial. 


(5) DISCUSSION OF RESULTS 

It is apparent from the results given here that the errors in 
measuring aerial radiation patterns and power gains may be 
appreciably different from those predicted by optical diffraction 
theory. The reason for this difference is that diffraction theory 
is used to calculate the distribution of electric and magnetic 
field strengths and that the essential step of deriving the signal 
received by an aerial of finite aperture is not considered. One 
exception to this is Krutter’s discussion} of the errors in gain 
measurement, where the effect of a taper in the amplitude dis- 
tribution of the aperture of the receiving aerial is mentioned as a 
possible source of error. Krutter’s suggested criterion of con- 
stancy of amplitude to within 10% is inadequate to ensure 
accurate gains, at least in the situation discussed in the previous 
Section. The errors arising from the amplitude variation can 
exceed those due to phase errors, and caution must therefore be 
used in applying correction factors of the type discussed by 
Polk. Similarly, examination of the radiation patterns shows 
that measurements on short sites can have errors of the opposite 
sign to those given by diffraction theory if even a moderately 
large receiving aerial is used. The procedure described by Bates 
and Elliott® to correct radiation-pattern measurements can be 
used only when small receiving aerials are used. 

The plane-wave-spectrum concept introduced by Booker and 
Clemmow is a very powerful tool in the analysis of this type of 
problem. The analogy to a frequency spectrum it raises provides 
a convenient qualitative picture of the reasons for the influence 
of receiving-aerial size on the measured results. The plane- 
wave spectrum, which for aerials is the far-field radiation pattern, 
is related to the transmitter-aerial aperture distribution in the 
same way as is the frequency spectrum of a wave to its time 
variation. The measurement of radiation pattern is therefore 
similar to that of a frequency spectrum and is basically a filtering 
problem. The filtering is partly effected by the destructive inter- 
ference of the waves travelling between the two aerials, which 
corresponds to the effect of a dispersive transmission line on a 
waveform. The receiving aerial, however, has also a filtering 
action because of its discrimination between plane waves arising 
from different directions. It is this additional filtering associated 
with the receiver which makes it essential to go a step beyond the 
normal diffraction treatment. 

The numerical examples given in the paper have been based on 
the uniformly illuminated aperture to facilitate computation. 
For aerials with distributions of the type realized in practice, 
similar results will apply but may be less severe. There is a need 
for further numerical investigation, but to obtain a sufficiently 
wide range of typical results would require a considerable 
computation programme. 
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(7) APPENDICES 
(7.1) Far-Field Approximation to Received Power 
The integral to be evaluated [eqn. (24)] is 


Y, es fed 7 , ‘ . 
D=5y5| | Gd — SFIS, SIFRCS;, Si) exp (—jkrC) 
dSdS,|C’ . (39) 


When kr s 1 the exponential factor is a rapidly changing function 
of S, and S, and the principal contribution to D arises from the 


stationary-phase region. Since 
C’ = og(l — S} — SP'? + s9Sp (40) 
the stationary point is given by 
S; = 0; So = 59 (41) 
Near this point, 
ee ND 
Co = 4st {52 = 50)" (42) 
% 
so that, by using a result quoted by Silver, * 
Yoc : 
D ~ SF iO, So) FR(0, 0) exp (—jkr) 
[ | exp Limes? + UJas\au (43) 
where U = (S, — So)/eo (44) 
: Yoco _,_~ 2m exp (j7/2) 
1.e. D= a2 FiO, Sp) FRO, 0) exp ( —jkr) ee Fa 
(45) 
De 
=/XOFTO, 50) FR, 0) exp (—jkr) (46) 


(7.2) Evaluation of Integration with Respect to S; by 
Stationary Phase 

Under the conditions described in Section 4.3 the integration 

with respect to S; can be approximated by the first term of the 

asymptotic expansion given by the stationary-phase method. 

The stationary point is at S; = 0, so that from eqn. (24) we have 
Yy, co 

p=m4]_.] 

DN I Dei 

exp {—jkr[e9 (1 — S} — $3)" + soSp]}dSidSy 


* Reference 2, p. 121. 


eo 


(I = $2)'P FTO, S2)F IO, S2) 
(47) 
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in which $, and S; are made zero in all terms except the 
exponential. 
The stationary phase method gives 


: tied meee 
| exp [—ikreg( — S} — S,)'?]ds, = (=) Pa = sai 
exp [—jkrey (1 — Sen + jr4] . (48) 
so that 
Yo exp (jz/4) 6° 
sane lee 


D 


S12 — S3)'4FTO, SFO, S3) 
exp {—jkr[eo(1 — S3)!/? + soSz]}dS> (49) 
Since 5S; = S; = 0, @— S17 = C’, Le. Ceg + S$50- 
(7.3) Evaluation of Integral in Eqn. (30) 
The integral which occurs in eqn. (30) is 


— | FT[O0, sin (@9 — )] cos € cos? (8) — €) 
be 
2 
exp E: cos € — 2 lee = EO) 
R 


where the integration contour I” is shown in Fig. 4(5). We 
are concerned with the value of J for aerials which have a narrow 
horizontal pattern, so that the only values of 45 and € of interest 
are those near to zero. The trigonomietrical functions cos € and 
cos (0) — €) can therefore be approximated to unity, except in 
the exponential, where cos & should be replaced by (1 — 4&7). 
Then 


1= J FTI0, sin Gp — SJ exp (—jkr — 4p8)dE . (61) 


where p= z — jkr (52) 
We may write p = |p| exp (—jn) (53) 
if n = arc tan (krB3/4L) (54) 
Let € = € exp (—4jn) (55) 


Then, J = exp (—jkr + 4jn) J F{(@y — 6) exp (—alpleaat (56) 


where F{ is now shown for convenience as a function of (8) — &). 
This integration is carried out along the contour for ¢ corre- 
sponding to € moving along I”, but since F7 is a regular function, 
the contour can be distorted into the real axis without altering 
the value of the integral. The integrand vanishes as |g | => CO, 
so that no contribution results from the infinite arcs joining the 
ends of the original and modified contours. The integral can 
now be evaluated using Watson’s lemma,’ with the result 


r= G a exp (—jkr ewe emo + a 
ee hes abe uae Og 
SS =e ee } . (67) 
From eqn. (55), we see that § = 0 when ¢ = 0, and, furthermore, 
(Hanes = E Ginn) ori 


ny, 
= exp csmyc—y| 4 # Ol 


6" (58) 
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ee) 


d"F Ag) 
Tet 0=00 


FTO) = | OTN 
Then 


= wre M2 exp (—jkr + sin| FTG) + Te FIO) +. 


. (2n — 1) exp(jny) FTCn) Oyen, " . (60) 
+. (Qn)! |p|" a 


(7.4) Evaluation of Integral in Eqn. (38) 


The integral to be evaluated is 


D=~[ | GS), 5) exp (—skrC)dSydS/C 
4 /— 4-0 


=-[ | es, 5) exp (= jkr CdS dS (61) 
TT ~0 0 


since G(S,, S,) is assumed to be an even function of S; and S). 
Changing the integration variables to S; and C gives 


= oi ds, | dCG(C) exp (—jkrO ld — C? ~ $7). (62) 
TT ~O Ty 
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Fig. 8.—Integration contours for integrals in Section 7.4. 


where [',, the integration contour for C, is shown in Fig. 8(a), 
and it is assumed that G is a function of Conly. Changing the 
order of integration in eqn. (62) gives 


p-3f 


I’, being the contour shown in Fig. 8(d). 


Vv (—C2) 
ac}. dS,G(C) exp (jkr) — C2 — S22 (63) 


In egn. (63), the integration with respect to S; can be evaluated — 


immediately, giving 


jee IL G(C) exp (—jkrO)dC (64) 


Let C=1-jy (65) 
and distort the integration contour for y into the real axis: this 
is permissible, since the integrand is a regular function, which 
will vanish, as |c| > 00 for all cases of interest. 
Then 
oe) 

D = sjexp (—jkr) | GU — jy) exp (— kry)dy (66) 
The integral is in the standard form for integration by Watson’s 
lemma, giving 
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: ; Gil Let n+r=t. (75) 
D = 4jexp (ik) Doral! 1) mn 
kr — (kr)*\dy/ y—0 Then, G(C) = 
1 a"G ures — ! 2(t—r) 
SS yaremernmy hoa 25 (Ca DE in Ox (2)—*1!(2t + 2 — 2r)'(md]JA) 
aaa dy? FeO aR be | (67) Cr ( zZ 2 


But 


i i a es 


We wish to evaluate the integral for the particular case of a 
uniformly illuminated circular aperture of diameter d, for which 


(68) 


GO) = 4G752(u)/u? (69) 
Gr being the gain in the forward direction, @ = 0, and 
u = (md)D) sin 0 (70) 


We can obtain the derivatives needed in eqn. (67) if G(8) can 
be expressed as a sum of powers of (1 — C). 


Now, 
HO) 1S ("An + Du]2 (71) 
we 4, Sonlin + Dn + D!P 
and = u* = (md/A)*(1 — C?) = (ndfA)? [2 — C) — (1 — ©] 
(72) 
Hence 
nh Eo Gg ! gy ve): i — r 
yen = (ard]A)"(1 z: ony a a C) (73) 
Ku) 1 2 Bn t+ Daad[r*" [RC — Yr" 
ie =; mH Ho ria — Ain + D!N@ + DIP ee 


=r 2) rt = Wit +2—N'G@+1—n!/ 


(76) 


from which it follows at once by inspection that 


dG mdr! 35 (Ot +2 — 2n)nd[d)-?" 
dc _ 2 Get Fe) y ECR 
(77) 


For large aerials, d s A, so that only the first term in the series 
need be considered, giving 


d'G (2t + 2)!Gr mes (78) 
a car E+ DYE HD!P L2H 
Collecting results from eqns. (67) and (78) we have finally 
__ jGy exp (—jkr) = (2t + 2)! —jrd?\t (79) 
an 2kr ‘o(t+2)'[¢+D)!P\ 4rd ) 
and, on using eqn. (27), 
Gin\2 2 (2t + 2)! —jnd?\*|2 (80) 
ee > (¢+2)!'[¢@+ D!IP\ 4rA ) 


where G,, is the apparent gain if measurements are made at 
separation d. 
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ELECTROMAGNETIC ENERGY TRANSFER 
By P. HAMMOND, M.A., Associate Member. 


(The paper was first received 7th August, and in revised form 1st November, 1957. It was published as an INSTITUTION MONOGRAPH 
in February, 1958.) 


SUMMARY 
Methods of calculating and measuring the flow of electromagnetic 
energy are compared and contrasted. The differences between the 
low-frequency and high-frequency approaches to energy flow problems 
are discussed and suggestions are made to ease the difficulties in the 
way of students and teachers faced with these apparently irreconcilable 
differences. 


(1) INTRODUCTION 


In teaching the principles of electromagnetism very much time 
is taken up with the discussion of cases of energy transfer. The 
aim of such discussion is to relate the primary concepts of charge 
and field strength to the process or mechanism of energy inter- 
change. The term ‘mechanism’ is used here in accordance with 
Maxwell’s aim of reducing electromagnetism to the study of the 
mechanics of electricity or of the aether.! This aim has by no 
means been completely achieved, as is borne out by two recent 
papers on the subject.4»5 

In focusing attention on this mechanism we seek first for a 
clear and consistent mental picture or concept. Neither mathe- 
matical theory nor experimental technique is primary in this view 
of the matter, although the value of the mental concepts will be 
judged by their fruitfulness in calculation and experiment. In 
the words of J. J. Thomson :6 

Students have a great tendency to regard the whole of Maxwell’s 
theory as a matter of the solution of certain differential equations 
and to dispense with any attempt to form for themselves a mental 
picture of the physical processes which accompany the phenomena 
they are investigating. I think that this state of things is to be 
regretted, since it retards the progress of the science of Electricity 
and diminishes the value of the mental training afforded by the study 
of that science. 

The same is true, a fortiori, of attempts to shorten the labour 
of thought by conducting extensive unplanned experiments. 

The process of acquiring a consistent set of concepts to deal 
with problems of electromagnetic energy interchange is difficult. 
The difficulties are increased by the accident of the historical 
development of the subject. Whereas some writers picture 
electric particles in an otherwise empty space, others see the whole 
of space as a seething mass of electric or magnetic whirlpools. 
It is no wonder that disaster sometimes overtakes the student 
who endeavours to reconcile these mental pictures by inserting the 
particles in the whirlpools. Nevertheless, the existence of such 
conflicting pictures, which we have inherited from the great 
electricians of the past, is a constant stimulus to clear thought. 
The paper is an attempt to clarify some of the apparent 
contradictions. 


(2) SOME SIMPLE EXAMPLES INVOLVING ENERGY | 
TRANSFER 
In this Section examples will be discussed involving direct 
currents or low-frequency alternating currents in order to analyse 


the concepts used in what is generally called heavy-current 
engineering. 


Correspondence on Monographs is inyited for consideration with a view to 
publication. 


Mr. Hammond is in the Department of Engineering, University of Cambridge. 


(2.1) Battery Circulating Current through a Resistance Wire 


Perhaps the simplest form of energy interchange is the heating 
effect of an electric current flowing through a wire which con- 
nects the terminals of a battery of constant voltage. If this 
voltage is V and the current is J, the battery supplies energy to 
the wire at the rate of V/ per unit time. This energy is expended 


in overcoming the resistance of the wire. Ohm’s law states that — 


V is proportional to J, ic. V = RI, where R is the resistance. 
This can be written as E = pJ, where E is the electric force, p 
the resistivity and J the current density. In this form the law 
can be applied to alternating currents, whereas V loses its useful- 
ness in the description of high-frequency effects. 

Ohm’s law is so simple in form that it is taken by many as 
almost self-evident. There is, however, a difficulty connected 
with it that will serve as a good introduction to the subjects 
discussed here. ‘This difficulty may be expressed in the question: 
what is the origin of the electric force E on the current in the 
wire at a place considerably distant from the terminals of the 
battery? Clearly E can arise only from electric charges. But 
those at the battery terminals can act appreciably only on their 
immediate neighbours, since the force decreases as the square of 
the distance. Consider, for instance, what happens when a 
resistance wire is connected to the plates of a battery. Before 
the connection is made there is a considerable electric charge on 
the plates, and hence a large electric force in the vicinity of the 
plates. After the connection has been made and transient 
currents have disappeared, there is a uniform electric force 
E = pJ everywhere along the wire. Thus the electric force 
close to the plates has been reduced, while that at distant points 
has been increased. How has this come about? This difficulty 
was understood by Ohm,’ who felt that only immediately adjacent 
particles of electricity could act on one another to produce the 


current flow. Ohm very nearly achieved a solution of this - 


problem by his consideration of the surface charge that would 
also be present. He went so far as to identify the galvanic 
electricity of his circuit with Coulomb’s static electricity. But 
he failed to see that this surface charge, combined perhaps with 
a charge distributed within the volume of the wire, provided the 
source of the necessary electric force. Instead, he took refuge 
in the analogy of the flow of heat, in which the problem does not 
arise.* 

The analogy in general use now is that of fluid flow in a pipe. 
Neither of these analogies is strictly accurate, because each fails 
to take into account that there are in the wire positive charges 
as well as negative ones. Consideration shows that a current 
flow in a conductor implies also a surface and possibly a volume 
distribution of electric charge. It is this distribution which 
produces the uniform electric force along the wire, irrespective 
of the path of the wire. Far from being obvious, it is surely 
amazing that the current should not be altered however much the 
wire is twisted and contorted (we are, of course, assuming 


* There is no intention here of detracting from Ohm’s great achievement in clarifying 
a subject beset with difficulties. It must be remembered that the various quantities 
used at the time, such as electric strength and intensity, had not been clearly defined. 
There were also great experimental difficulties, owing to the absence of measuring 
instruments and the uncertain behaviour of batteries subject to polarization. The 
immense development of Ohm’s thought is well seen if his paper of 18259 is compared 
with the great work of 1827.8 
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constant resistivity in this discussion). This problem is very 
similar to the magnetic one of providing a path for a magnetic 
flux by means of an iron core, where the magnetic force has to be 
provided by surface and volume distributions of pole strength.!0 
It is well known that the leakage flux resulting from the surface 
distribution will depend very considerably on the configuration 
of the iron core. In principle, therefore, there must also be, in 
the electric case, some difference in the surface distribution of 
charge, and hence in the current flow, if the path between the 
battery terminals taken by the wire is altered. For magnetic 
forces this alteration in the flux is measurable, but in the case of 
current flow no such change has been observed. This is not 
surprising, since a coulomb of charge will give an electric force 
of the order of 10’? volts/cm at a distance of lcm. Thus, only a 
minute amount of charge is required to cause the current to 
flow. Nevertheless, a total absence of electric charge would 
also mean the cessation of current flow. The necessity for this 
charge distribution is hardly ever mentioned in the. textbooks. 
In general, students have to be satisfied with references to water 
pipes. Pidduck,!! however, mentions the existence of these 
charges. The explicit realization that they must be present will 
be found very helpful in understanding the subsequent discussion 
in the paper. 


(2.2) Energy Interchange in a Simple Direct-Current Motor 
or Generator 


Cullwick!? has discussed in considerable detail the action of a 
motor or generator whose moving element is a straight conductor 
moving transversely to its length across a uniform magnetic field. 
To avoid the complication of a moving frame of reference, let us 
consider a simple d.c. machine in which the field poles move 
past stationary armature conductors. On open-circuit there is 

‘a charge separation in the armature conductors which exactly 
opposes the motional induced electric force Bv, where B is the 
magnetic flux density and v the velocity of the field system relative 
to the conductors; only so can the net electric force be reduced 
to zero. It should be noted that this charge separation will 
produce a distribution of charge throughout the length of the 
conductors and not only at the ends. The reason for this is 
the same as that advanced in Section 2.1. There must be zero 
electric force everywhere along the conductors, since no current 
flows on open-circuit. But charges at the ends of the conductors 
can act only at places close to those ends. They are not capable 
of producing zero electric force throughout the length of the 
conductors, and hence there is a need for the additional charge 
distribution along the conductors. It is helpful to notice 
explicitly that the electric force induced by the motion is equi- 
valent to that produced by a distribution of electric charge equal 
and opposite to the open-circuit distribution. 

Consider now the action of a motor with constant armature 
current. (As before we are considering the case of a stationary 
armature.) When the field system is at rest, a charge distribution 
will be required to produce everywhere an electric force pJ. 
This charge distribution will correspond to a terminal potential 
difference V, = IR. When the motor is running, this charge 
distribution will no longer be sufficient to cause the current to 
flow, because the induced electric force will be equivalent to an 
opposing charge distribution. In order to produce a resultant 
electric force pJ, an additional charge distribution is required. 

This is expressed by the well-known motor equation V; = V; 

_ + IR, where V, is the terminal voltage, V; the induced electro- 

motive force, and / and R are the armature current and resistance, 
respectively. The charge distribution has had to be increased 
by (V; + IR)/IR. ire 

- "All this is rather obvious and the terms ‘potential difference’ 

_and ‘induced e.m.f.’ are well understood. But it is rather sur- 
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prising to find that an exploring test charge would not be able to 
differentiate between the case when the motor is stationary and 
when it is running. In either event the observed electric force 
would be pJ. Neither the applied p.d. nor the induced e.m.f. can 
be observed separately. The equation V, = V; + IR is written in 
this form chiefly in order to present a clear mental picture and 
to separate the energy loss from the gross mechanical output. 
Since only the resultant force can be observed, we propose to 
call V, and V; the partial fields, whereas /R represents the total 
or resultant field. It may come as a shock to some engineers to 
realize the largely conceptual nature of such well-known quantities 
as V, and V;. 

A very similar argument holds for the case of a generator, 
where we have the equation V; = V,-+ JR. In this case the 
induced electric force must overcome the opposing charge dis- 
tribution V, and provide a resultant electric force equivalent to 
IR. Once again, only the resultant force would be observed by 
an exploring test charge. As long as the exploration is confined 
to the conductor system, it is not even possible to differentiate 
between motor and generator action. 


(2.3) Alternating-Current Circuits Involving Inductance and 
Capacitance as well as Resistance 


In investigations into the behaviour of time-varying currents, 
it was early discovered that the simple Ohm’s law relationship 
V = RI was not sufficient to describe the observed results. The 
two new concepts of inductance and capacitance had to be 
introduced. The concept of inductance arose from a con- 
sideration of Faraday’s laws of electromagnetic induction. It 
was clear that the applied voltage and the current in a circuit 
were, in the absence of capacitance, connected by a relationship 
such as v = Ldi/dt + Ri, where L is the inductance. In the 
terms of our previous discussion, it follows that with alternating 
currents an additional charge distribution is required to give rise 
to a p.d. of Ldi/dt. Once again, however, the actual electric 
force within the conductor is E = pJ and this is the force which 
would be observed by a test charge. Faraday!3 attributed the 
inductance effect to the action of the magnetic field surrounding 
the current. In other words, Ldi/dt, like the back-e.m.f., V;, of 
a motor, is an effect that could be isolated only if the conductor 
were removed. Since the current would then cease to flow, Ldi/dt 
cannot be thus isolated: it can, however, be taken to represent an 
electric field in free space. This is very seldom stated explicitly 
because, as in Ohm’s law, the charge distributions accompanying 
the effect are ignored. A similar discussion applies to the effects 
of capacitance, which result in an additional charge distribution, 
giving a p.d. of Cf idt. In this case the action is attributed to 
the electric field surrounding the conductor. 

For the motor it was found convenient to distinguish between 
V,and V;, in order to divide the input power V,J into mechanical 
power V,I and ohmic loss (V, — V;)J. A similar useful result 
is achieved by separating v and Ldi/dt. vi gives the input power 
and (v — Ldifdt)i is the ohmic loss; hence (Ldi/di)i can be 
attributed to the magnetic field. Maxwell? showed that this 
term corresponded to an energy of amount given by the volume 
integral {f{f4BHdv, where the integration is to be taken 
throughout all space. This result, however, applied strictly 
only to steady magnetic fields. A similar result was obtained 
for the capacitance term, and the corresponding energy was 
found? to be fff 4DEdv. 

If an alternator is connected to a circuit, its energy output is 
partly dissipated in heating the conductors. The rest of the 
energy can be deemed to be stored in the electric and magnetic 
fields surrounding the circuit. It was thought that the stored 
energy was fully recoverable by the alternator and its associated 
circuit. This, however, was questioned by Fitzgerald!* in 1882. 
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Fitzgerald based his reasoning on theoretical grounds and showed 
that, on the basis of Maxwell’s hypothesis of displacement 
current, some energy must be lost to the alternator and must be 
radiated into space. 

In discussing this effect it becomes essential to abandon the 
terms of voltage, inductance and capacitance, but the argument 
is not affected. Instead of the voltage equations we now have 
the Lorentz force equation 


Py) 
b= —-“ 4 ox B—gradd 


where A is the delayed vector potential, d is the delayed scalar 
potential, and »v is the velocity of the charge through the field B. 
This expression for E shows separately the inductance, motional 
and capacitance effects. Fitzgerald’s hypothesis meant that 
neither 0A/d¢ nor grad d was necessarily in time-quadrature 
with the current density J. We note that at a conductor 
—)A/dt +u x B—grad¢d = pJ. The effects are isolated for 
ease of computation and more especially to give a clear picture 
of the various items in an energy balance sheet. 


(2.4) Action of a Transformer 


Whereas the self-inductance effect at low frequencies was found 
to give no net energy transfer,!* it was apparent, from Faraday’s 
discovery of mutual induction, that energy could be transferred 
from one circuit to another. In addition to the various 
charge distributions already mentioned, there will now be 
another. Moreover, the instantaneous magnitude of this dis- 
tribution on the primary circuit is governed by the secondary 
current, and hence its phase depends on the secondary current. 
In terms of the Lorentz force E = — 0A/dt +u x B — grad ¢, 
dA/d¢ may now have an in-phase component with the current 
density, and this would provide a unidirectional or irreversible 
energy interchange. It is of interest to point out that in a.c. 
rotating machines the phase of the » x B term is also generally 
arranged to give some unidirectional flow. This is not invariably 
so, however, and examples to the contrary are synchronous 
condensers or machines like the Kapp vibrator. 


(2.5) Some Conclusions 


It has been shown that energy is transferred through the action 
of charge distributions on the conductor. The source of power 
experiences back-e.m.f.’s which may or may not be in phase 
with the voltage generated by the source. It is extremely con- 
venient to separate the effects into magnetic and electric field 
effects and into mechanical and chemical energy effects. Each 
one of these effects may be considered to act separately in 
impeding or aiding the flow of electric current from the source. 
However, it must be kept in mind that the actual current flow is 
the result of all the effects acting together, and for a particular 
current the resultant electric field must always be governed by 
the resistance of the conductor. Thus it follows that an investiga- 
tion into the electric field in the conductor cannot give any 
indication about the energy flow from the conductor, whether 
it be mechanical or electromagnetic energy. The concepts so 
far employed show why the flow into the conductor may be 
impeded, but once the flow has been established these concepts 
tell us nothing about the method of energy flow from the 
conductor. 


(3) ENERGY TRANSFER BY ELECTROMAGNETIC 
RADIATION 


(3.1) Maxwell’s View of Electromagnetic Action 


Maxwell’s proposition that light is an electromagnetic dis- 
turbance suggested to his contemporaries that new concepts 


would have to be found and new mathematical methods developed 
to account for the problems associated with energy flow. Two 
questions presented themselves. The first concerns the transfer 
of energy to distant bodies, whose action on the source of the 
energy can reasonably be assumed negligible. In the language 
of power appliances, this question concerns the possibility of 
power flow when the device is on open-circuit. The second and 
related question concerns the possibility of localizing the energy 
flow in space and thus using the language of optics and, in 
particular, of Huygen’s principle in electrical problems. Max- 
well’s preoccupation with the properties of the aether greatly 
reinforced the view that such localization of energy was to be 
expected on physical as well as mathematical principles. He 
failed to convince a number of electricians, such as Lord Kelvin, 
who had perhaps more of an engineer’s turn of mind and clung 
to the belief that the conducting matter in the circuits was 
actually both the seat and the vehicle of energy interchange. 
However, his success was complete with those who concerned 
themselves with the possibility of electromagnetic radiation. It 
is of interest to note that this difference of approach is to a 
great extent responsible for the somewhat irrational distinction 
between heavy-current and light-current engineering. 


(3.2) Poynting’s Theorem of Energy Flow 


A very famous attempt at a solution of the twin problems of 
energy flow in space was made in 1884 by J. H. Poynting, and 
has since become known as Poynting’s theorem.!7 

Poynting considered the rate of change of energy distribution 
within a closed volume. Taking Maxwell’s expressions,”>? he 
writes 


Gee. Ae 
curl H= J+ dD/dt 
curl E = — dB/odt 


Therefore E.dD/dt + AH. dB/dt 
=E.crlH—E.J—H.curlE 
= —E.J—div(E x A) 


Kes + Tao + [ [ [sem 
7 —|je x H). ds 


Thus the increase in stored energy, together with the ohmic loss, 
equals the in-flow of a vector E x H across the surface bounding 
the volume considered. (Poynting also considered terms due to 
the motion of the circuits. These have been omitted here for 
the sake of simplicity.) The Poynting vector E x H could thus 
be interpreted as the vector indicating the density of energy 
flow at any part of the surface, or indeed anywhere in space. 

The possibility of energy flow on open-circuit, which would 
be akin to the method by which solar energy reaches the earth, 
was thus shown by Poynting to depend merely on the magnitude 
of £ and A. It could readily be shown that on Maxwell’s 
hypothesis E and H would have components varying inversely 
as the distance. Thus f f (E x H).ds would be finite at even 
the largest distances, and Fitzgerald’s suggestion!+ could be 
demonstrated mathematically. Poynting himself applied his 
theorem to some simple cases of current flow including d.c. 
problems. In every case the same answer could be obtained by 
his method as by the older view of pushing an electric current 
through a wire against opposing forces. In particular, for a 
straight wire carrying a current, he showed that by his view the 


But 


and 
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energy would pass into the wire through its insulation at right 
angles to its length. In a second paper!® he went further and 


discussed the actual velocities of the tubes of electric and mag- 
netic flux. 


(3.3) Criticism of Poynting’s Theorem 


Soon after Poynting’s theorem had been published, it was 
shown by Sir. J. J. Thomson’ that the expression for the power- 
flow vector given by the theorem was not unique. Another arbi- 
trary vector P = curl Q could be added, since ff curl Q.ds = 0. 
Thus the local energy density was not uniquely determined by 
Poynting’s theorem. Various other forms of the vector have 
been proposed!*:!9,2° which still give the same surface integral. 
But Poynting’s vector has the great advantage of simplicity and 
has secured practically universal acceptance. Sumpner,?! how- 
ever, discussed the question of energy flow on the basis of 
Huygen’s principle and reached the conclusion that the Poynting 
vector did not describe the actual movement of energy.. Sumpner 
was confirmed in his view by Poynting’s result in his 1885 paper!8 
of a velocity for the electric fiux of nearly 60 million times the 
velocity of light. 

A more serious attack on Poynting’s theorem was made as 
early as 1902 by Macdonald.22_ He pointed out that Poynting’s 
expression for the magnetic field energy in a closed volume 
{J {4 . Bdv had been derived by Maxwell? for static magnetic 
fields only. Moreover Maxweil’s volume integration had to be 
carried out over all space. Macdonald proposed, as the correct 
power flow vector, the expression 

(0A oD) 
Ex H+3( x H+A x =) 
-where A is the vector potential or, as Macdonald terms it, the 
‘electrokinetic momentum’. Because Macdonald’s expression 
for the magnetic field energy is different from that used by 
Poynting, the surface integral of the energy-flow vector is also 
different. It is surprising that Macdonald did not point out that 
Poynting’s expression for the electric field energy was open to 
the same objections as that for the magnetic field energy. Once 
again Poynting had applied Maxwell’s expression for a static 
unbounded field3 to that of a time-varying and bounded one. 
This point also seems to have escaped the attention of later 
writers. Stratton?> discusses the question in detail and justifies 
Poynting’s treatment by postulating that Maxwell’s result could 
be applied to a bounded space if the bounding surface had not 
yet been reached by the wavefront of the disturbance. This 
argument had, however, already been shown by Macdonald to 
be invalid.23 The question naturally arises if an experiment can 
be devised to test the validity of Poynting’s theorem. Macdonald 
points out that such an experiment would have to be able to 
deal with times shorter than the periodicity of the time-varying 
currents.24 Under ordinary time-varying conditions there is no 
possibility of differentiating between Macdonald’s and Poynting’s 
theorems. Slepian,!? in a very ingenious derivation of the 
energy-flow theorem, has given nine alternative forms for the 
flow vector, showing that the choice is far wider than either 
Poynting or Macdonald supposed. 

It would appear that Poynting’s theorem fails to achieve its 
aim. It does not give the insight, hoped for by its author, into 
+he mechanism of energy flow in space. Nor is its mathematical 
»asis sound. But it does provide a simple method of calculating 
che open-circuit energy flow. However, the mental picture of 
énergy flow suggested by Poynting is perhaps best avoided in 
elementary teaching. It is possible that nature does not give 

‘n answer to Poynting’s question about the localization of 
energy. Certainly his approach would require research into the 
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behaviour of wavefronts, and such investigations at present lack 
both experimental technique and mathematical formulation. 


(3.4) Application of Poynting’s Theorem 
The mathematical formulation of Poynting’s theorem is 
beautifully simple. Very severe difficulties are, however, encoun- 
tered in the application of the theorem to an actual example of 
energy flow. We state the theorem as 


oD 


[fe xsd [f[(-43 -8- 5, 


ic. outflow of energy = (decrease of stored energy) 
— (ohmic loss) 


E.J)dv 


Consider fields that are varying harmonically; there will then be 
no net decrease of stored energy over a complete cycle. The 
statement thus becomes 


inflow of energy = ohmic loss 


and there is apparently no allowance for radiation of energy. 
This problem is best elucidated by taking an actual case. 
Consider the Hertzian oscillator, which is discussed in detail in 
the Appendix (Section 8). The charges and current of this 
oscillator are postulated. The fields of these charges and this 
current are determined in the usual manner and the Poynting 
integral then gives the well-known expression for the radiated 
energy. But in postulating charges and current nothing has 
been said as to the physical possibility of causing them to be 
disposed in the postulated manner. We realize, following our 
earlier discussion, that additional charges and currents have to 
be present because the total current flow is governed by the 
ohmic relationship E = pJ. The Poynting vector has, in fact, 
been calculated by using only the partial fields. The Poynting 
integral is here 
SJS@, x). 4s 
E,=E-E 
H, = H —H’ 


the suffix p denoting partial fields. E and H are the total fields 
and £’ and H”’ are the additional fields to make the current flow 
in the oscillator physically possible. Thus Poynting’s theorem 
applied to the partial fields becomes 


where 
and 


[|G x Has = | [| (-#,. = Ag E, . Jp)do 


Thus, for harmonically varying quantities, the Poynting integral 
now gives the product of the postulated partial current and the 
partial electric force opposing its flow, and this is the radiated 
power. It will be noticed that the Poynting method is exactly 
analogous to the back-e.m.f. method used in power devices, 
because the back-e.m.f. is also essentially a partial field, which it is 
convenient to keep separate for the purposes of computation. 

It should be noted explicitly that the partial fields are invariably 
used in calculations of the energy radiated from an aerial. The 
total fields are in general unknown and very complicated, except 
on the actual material of the conductors. However, if it is 
desired, for instance, to calculate the power input into the metal 
inside an induction furnace, the total fields must be used. In 
such a case the power input looked for is, of course, equal to the 
ohmic loss. It is clear that the Poynting integral using total 
fields must be taken around the sink and not about the source of 
the energy. A similar integral taken around an aerial or a trans- 
former winding or other source of energy would, in general, give 
the ohmic loss in the aerial or the winding, and would supply no 
information about the power radiated or transferred. 
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If Poynting’s theorem is to be used as a guide for taking 
measurements, it is clear that only the total fields can be observed. 
It is therefore desirable to investigate what information, besides 
that of ohmic loss, can be obtained in this manner. 

It should be noted that the statement of Poynting’s theorem 
has assumed that all energy is electromagnetic: chemical, thermal 
and kinetic energies have been excluded. This implies that the 
surface of integration is chosen so as to exclude all generators 
and batteries. Thus the surface must at some point intersect 
the supply leads (see Fig. 1). It has been shown that if we 
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Fig. 1.—Radiation of power from a Hertzian oscillator. 
The surface of integration for the Poynting flux vector intersects the supply leads. 
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Fig. 2.—Radiation of power from a Hertzian oscillator. 
The surface of integration for the Poynting flux vector encloses the alternator. 


calculate f (EZ x H)ds over such a surface we shall arrive at the 
ohmic loss in the conductors. Such an integral will give no 
information about radiated power. The integral [[(E, x H,)ds 
will, however, give this power. If, on the other hand, the surface 
of integration is chosen so as to enclose the alternator (see 
Fig. 2), additional electric forces due to the energy source will 
have to be taken into consideration. We can now write 
E +- E, = pJ, where E, is the additional electric force. 

This additional force may aris: from a wide variety of causes. 
Perhaps the simplest example is ‘hat of a battery, in which it is 
caused by chemical action. If, 0. the other hand, the electro- 
magnetic energy is derived from a prinae mover, such as a turbine, 
additional terms would have to be in‘roduced into the energy 
equations to account for the shaft work done on the generator. 
Such further terms are not essential to te present discussion 
and have been omitted to simplify the treatrent. 

Hence 


[fexm.a~ [ff 222-2222. 


We ey: 


Hence, for harmonic variation, 


Poynting integral = (work done on the current) — (ohmic loss) 


In such a case the Poynting integral gives the radiated power 
correctly. On the other hand, the integral using the parta] 
fields fails. This does not matter in computation because the 
partial fields still give the correct answer if the term containiny 
E, is omitted. In other words, the supply leads have to be 
lengthened to exclude the generator from the volume of integra 
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tion. It is clear, in any case, that the concept of back-e.m.f. 
would have to be abandoned if the generator were inside the 
volume. This consideration shows the dominance in the 
Poynting integral of that part of the surface which intersects 
the supply leads. 


(3.5) Use of Alternative Forms of Poynting Vector 


The simplicity of the expression E, x H, in calculation is 
such that it seems unlikely that a better form can be found. 
Hines,2° however, shows that Macdonald’s form gives somewhat 
simpler expressions in certain cases. Slepian in an early paper?’ 
favours Poynting’s expression, but more recently2° supports the 
expression V(J + 0D/d#) + H x dA/dt. This expression has 
the considerable advantage that for direct current it becomes 
VJ, ie: power = voltage x current density, and thus fits into 
the ordinary engineering framework. Moreover, it disposes of 
the somewhat repugnant notion that a static condenser and a 
permanent magnet placed near one another engage in a process 
of unending and unmeasurable exchange of energy. 

Attempts at building wattmeters capable of measuring the 
Poynting energy flow by direct measurement of the electric and 
magnetic fields are met by the extreme difficulty of measuring the 
electric force in space. At high frequencies this can be done by 
a monitor aerial embodying a thermocouple. For low fre- 
quencies, e.g. in eddy-current heating, a variant of the Poynting 
vector has been proposed to the author by Dr. R. W. Sillars, 
namely Q0B/dt, where Q is the magnetic scalar potential. Since 
this form involves measurements of magnetic quantities only, it 
is far more convenient than the various possible alternatives. 
The derivation of this vector is given in the Appendix (Section 8.1). 


(4) RADIATION RESISTANCE 
(4.1) Induction and Radiation Effects 


Poynting’s work of 1884 had given a mathematical basis to 
the Fitzgerald suggestion of open-circuit transmission of energy. 
Hertz, in 1886, provided the experimental basis for the study of 
electromagnetic radiation. His use of high-frequency oscilla- 
tions, to obtain measurable effects, gave scientists the impression 
that these were different in kind as well as in magnitude from 
the well-known effects of electromagnetic induction. Indeed, 
Hughes, who demonstrated Hertz’s effect as early as 1879 before 
the President of the Royal Society,!® was told that his effect 
was due to ordinary electromagnetic induction. This curious 
division of induction and radiation effects persists in present-day 
teaching, in spite of the fact that both induction and radiation 
effects are calculated by the identical Maxwell relationships. This 
is the cause of much puzzlement amongst students. 

The difficulty is overcome when it is pointed out that there is 
no difference in kind between those alternating currents which 
radiate energy and those used in devices like induction motors. 
Once it is found that Ampére’s equivalence of magnets and 
currents, expressed by the relationship curl H = J, is valid only 
for steady currents, it becomes apparent to the student that the 
magnetic field of alternating currents is not in phase with the 
currents. Hence the electric force always contains a component 
in antiphase to the current and there is a unidirectional energy 
flow away from the circuit. The magnitude and relative impor- 
tance of this anti-phase electric force depends on the frequency 
and it becomes dominant at high frequencies. At low frequencies 
the quadrature effect is more important. This relative impor- 
tance should not, however, obscure the fact that it is not possible 
to have pure induction fields or pure radiation fields. Con- 
ductors arranged in certain geometrical shapes may produce the 
superposition and cancellation of some of the field components, 
and this cancellation may cause the apparatus to radiate relatively 
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more strongly if the frequency is increased beyond a certain value. 
But these are special cases. The question whether a certain 
apparatus will radiate not infrequently betrays a lack of under- 
standing of the fact that any alternating current cannot help but 
send out energy by radiation. 


(4.2) Calculation of Radiation Resistance 


In determining the energy transmitted by radiation it is fre- 
quently helpful to use the term ‘radiation resistance’. By 
analogy with the ohmic resistance this is defined as the average 
power divided by the square of the r.m.s. current. 

Most writers obtain this resistance by a method of calculation 
involving fJ(E, x H,) .ds. We have shown that an alternative 
approach would be to use ff {(— E, . J,)dv and thus to apply the 
concept of back-e.m.f. to the problem of radio aerials. With 
certain exceptions, notably that of Moullin,?8 this second method 
is avoided and the repugnance to it is well expressed by Aharoni,2? 
who states that there is no such physical quantity as radiation 
resistance. Many writers feel that the flux of energy described 
by the Poynting integral gives clear physical insight, whereas the 
back-e.m.f. concept does not do so. The term ‘aperture’ is 
frequently employed in this discussion and is regarded as an 


Fig. 3.—Field of a Hertzian oscillator. 
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Fig. 4.—Field of a line current. 


opening to allow the Poynting flux to emerge. But our discussion 
strongly suggests that ‘physical reality’ can be claimed for neither 
method. The Poynting integral uses partial fields, as does the 
back-e.m.f. method. Both methods are essentially tools for the 
purposes of calculation. There is, however, much to be said 
for the back-e.m.f. method, because frequently the volume 
integral is easier to determine than the surface integral, if the 
currents are confined to paths involving a simple geometry. A 
comparison of the two methods applied to a Hertzian oscillator 
and a line current is given in the Appendix (Section 8.2). 


(5) CONCLUSION 


In such a vast subject as that of electrical engineering a teacher 
can best serve his students by attempting to lay a sound founda- 
tion of physical concept as well as of mathematical analysis. 
The subject of energy transfer is taught by means of apparently 
irreconcilable concepts, the choice being governed by the largely 

relevant considerations of the frequency employed or the 
physical size of the apparatus. In power devices the concept is 
that of back-e.m.f. and in radio devices Poynting’s theorem is 
used. It has been shown that the two methods are identical and 
elated by a simple mathematical transformation. Moreover, 
»either method is able to give any information about localized 
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distribution of energy in space, and both ignore the charge 
distributions on the conductors, which make the current flow 
possible. It is hoped that a knowledge of these matters will be 
of help to the student and will add to his enjoyment of the 
subject. 
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(8) APPENDIX 


(8.1) Derivation of an Alternative Form of Poynting’s Vector 
Suitable for Measurements at Power Frequencies 


At low frequencies the magnetic field H can be approximately 
derived froma scalar potential functionQ. Thus H ~ — grad Q. 


Therefore | | (E x H).ds = (I We E x grad Q). ds 


= I (curl QE — Q curl E). ds = {J (— Qeurl E). ds 


Hence the alternative vector is Q20B/d¢. This expression is 


not applicable to high-frequency problems. 


(8.2) Comparison of Calculation of Radiation Resistance by 
Poynting’s Theorem and the Back-E.M.F. Method 


(8.2.1) Hertzian Oscillator. 


Let the charges vary as Q cos wi. 
at a distant point P is given by 


Wr; Wro 
COs {| @t — — cos {| wt — — 
Q c E 


a Ameo ry ie) 


gis “Ee cos (wz = =) — © sin (wt — | 
47€9 [Lr Cc cr c 
The retarded vector potential at P is parallel to the current and 


is given by 
Aric EH sin (cw cee) 


The retarded scalar potential 


The magnetic and electric fields are given by the following 
relationships: 


curl A 
Ie > ie = ae 1 = 
Lo or t 
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At large distances all terms decreasing more rapidly than 1/r can 
be neglected. 
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It will be noted that these are all partial fields. 
integral is given by 


Ey = 


The Poynting 
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The radiation resistance R = ae [4 ohms. 


To achieve the same result by the back-e.m.f. method, we 
need the component of the electric force at the wire in antiphase 


to the current. 
Close to the dipole, terms in 1/r can be neglected. 


Tsin wr w ., wr 
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The component of £, in sin wt is given by 
Qlsinys 203 — Alw 19 
A4Tr€p 3 3 677c 


This is independent of r and gives the field at the dipole. 
need ys = 90°. 
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Therefore |B, te —— sin wt 
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Therefore El = — = ——Ho sin? wt 
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Hence radiation resistance R = Zao Ho ohms, as before. It will 


be noted that the second method avoids the integration and is 
slightly quicker. 


(8.2.2) Long Current Filament. 
Va—0 
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Al = Hol} Yo(— ") sin ot ~ Jo( 2%) e0s a 


where Yq and Jo are Bessel functions of order zero. 
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The Poynting integral is [{(E x H).ds, and the power com- 
ponent will be 
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The expression in brackets can be simplified?° and then 
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If / = 4A, where A is the wavelength, this expression leads to a 


radiation resistance 
TT 
R= al (2) ohms 
€o/ 4 


By the back-e.m.f. method, 


i (E x H)ds = 


E, in phase with J at the filament is given by 
Low 
4 


Ee sin wt 


Hence, radiation resistance per unit length is — = = 
c 
and R= bop per half-wavelength = sh (2)" ohms 
€9 


In this case the back-e.m.f. method is markedly quicker, 
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DISCUSSION ON THE 


Dr. H. Aspden (communicated): The point which Mr. Hammond 
makes concerning the manner in which existing applications of 
electromagnetic theory ignore the charge distributions on con- 
ductors is most important. Indeed, it raises an interesting 
question because it is not inconceivable that when such a charge 
distribution is allowed for in the calculations the result may 
well be that there is no true energy radiation from electrical 
circuits, that is, no energy lost to all the circuits in a complete 
system by irrecoverable radiation to outer space. 

Mr. Hammond points out that it was once believed that the 
energy stored in the field of an alternator was fully recoverable 
by the alternator but that this was later questioned by Fitz- 
gerald in 1882 on purely theoretical grounds. 

It is well to bear in mind that the same electromagnetic theory 
as applied by Fitzgerald in his arguments leads to the prediction 
that there is energy radiated from an accelerated electron at a 
rate given by 

OW 2ef2 


Bs eek ergs/sec . 


(A) 
where e is the charge of an electron, c is the velocity of light 
and fis the rate of acceleration of the electron, all in C.G.S. units. 

Yet, it is a known fact that an accelerated electron does not 
radiate energy. Indeed, if it did, the electrons of all atoms 
would long since have dispatched their kinetic energy to outer 
space. 

This touches on the point emphasized by Mr. Hammond 
because the derivation of eqn. (A) presupposes that an electron 
‘can be accelerated without specifying the means. However, 
when the acceleration is produced by an electric field of intensity 
E, eqn. (A) becomes 


(B) 
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where ()W/d1),. is the rate at which energy crosses a boundary 
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at radius x from the electron charge e. 
electron, this result can be written 


If m is the mass of the 


OW 4E%e7x 7 2 
le ) ou ‘) 


2 
3m2c3 \ Dx iy: ) 
because mf = eE. 

Consideration shows that this equation cannot be generally 
valid for all values of x, because for some values of x the energy 
is radiated outwards and for others inwards. This indicates 
some fundamental inadequacy of electromagnetic theory. If 
the equation holds at or outside a radius x equal to 4e?/mc? 
there is no outward radiation of energy. This is of interest 
because it so happens that 4e*/mc? is the classical radius of the 
electron assuming that it is a hollow spherical shell of charge. 
The electric field energy is 4e?/x and is equal to mc’. 

Mr. P. Hammond (in reply): Dr. Aspden seeks to apply the 
conclusions of my paper to radiation by individual electrons. 
Such extrapolation fills me with misgivings. The paper deals 
entirely with classical electromagnetic theory. The basic con- 
cept is that of electric charge, and I do not discuss the action of 
individual electrons. The difficulties in classical electron theory 
have been the subject of much investigation. Dr. Aspden is 
right in drawing attention to the necessity for including in the 
equations the force on the electron. In fact the classical force 
equations can be applied only to cases in which the external 
force is large compared with the ‘radiation reaction’ force. But 
I do not follow him when he says that accelerated electrons do 
not radiate energy. Surely this radiation has been frequently 
observed and is a serious problem in the action of such 
accelerators as the cyclotron. Reverting to my paper I should 
like to stress the fact that the existence of charge distributions, 
which I show to be necessary to current flow in conductors, does 
not and cannot throw doubt on the observed facts of electro- 
magnetic radiation. It does, however, elucidate the mechanism 
of energy transfer and shows under what circumstances it is 
possible to use Poynting’s vector as a tool in calculation. 
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SUMMARY 


Principal microwave problems encountered in the design of long- 
range communication systems using waveguides are discussed. The 
reasons for choosing as the medium of transmission the circular wave- 
guide excited in the Ho;-mode are given, and means of bridging 
engineering difficulties are analysed in some detail. 

In particular, it is shown that plain metal waveguides are not 
suitable for long-range application, unless the waveguide run is sub- 
stantially straight. Practical waveguides, however, must be able to 
follow ground contour and other unintentional bends—a condition 
under which the performance of conventional waveguides is con- 
siderably degraded. 

The problem of unintentional bends is analysed in detail using 
perturbation calculus and the concept of surface impedance. The 
formulae obtained lead to design criteria for waveguides which are 
free from the above restrictions. A number of suitable waveguides 
are introduced and the proportions are chosen with the help of charts 
given in the paper. A circular metal waveguide coated with a thin 
skin of a dielectric is a possible solution. 

Circular anisotropic waveguides are discussed at length and, among 
other designs, the helical waveguide of suitable proportions is shown 
to be another possible solution. 

Numerical examples are given to illustrate the design approach and 
to contrast the performance of optimum designs with conventional 
waveguides. These examples also show that, whereas the special 
waveguides can be bent—for example, to follow the ground contour— 
to a radius as small as 100m (or even less) without causing any appre- 
ciable deterioration in their electrical performance, the conventional 
waveguides under similar conditions would be considerably degraded, 
even to the extent of being useless for practical applications. 


LIST OF PRINCIPAL SYMBOLS 


Wo, Wy, Wo, ... = Various wave functions. 
7 = Coupling coefficient. 
6 = Angle of bend. 
6, = Critical angle of bend. 
1 = Arc length. 
S = Span length. 
R = Radius of bend. 
d = Deviation at the centre of a 10m span. 
s = Waveguide radius. 
Ho. €9 = Permeability and permittivity of free space. 
f = Frequency. 
w = 27 f= Angular frequency. 


27 
ko = wr/(Lo€0) = a = Free-space phase-change coefficient. 


Xy = Free-space wavelength. 
Zo = V (Hol€o) = Free-space impedance. 
4, €, 9 = Permeability, permittivity and conductivity 
of a medium other than free space. 
€, = Relative permittivity. 


, 1o . . peer 
Gi e(1 -- a) = e(1 —j tan 6) = Complex permittivity. 
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5 = Loss angle. 
Yr = &r + jBr = Propagation coefficient along the arc of the 
bent guide. 
y = « + jB = Propagation coefficient (axial). 
a = Attenuation coefficient. 


Bs = = Phase-change coefficient. 
rf 
hy = oa = Cut-off coefficient. 


Cc 
(A), S(y) = Various perturbation terms. 
= ho + oh = Perturbed value of Np. 
Xr, SzH, %zp = Various attenuation coefficients. 
Z, = R, + jX, = Surface impedance (normalized with respect 
to Zo). 

Z,, Zz = (Anisotropic) principal components of Z;. 

Z;, 2, = Circumferential and axial components of 
surface impedance, respectively. 

Zm, Zq = Surface impedance contribution terms due _ 
to metal backing and dielectric skin, 
respectively. 

n, = Helical co-ordinates. 
ys = Lay angle of an anisotropic surface. 
Pp = Quantity defined by eqn. (11) in the case of ~ 
isotropic waveguides and by eqn. (48) | 
for anisotropic waveguides. 


Positive time factor is understood and the wave numbers are 
connected by 
B=h+P=B%-¥ 


The quantity Zp) is absorbed in the symbol H (magnetic field 
vector) and consequently all impedances and coupling coefficients 
are normalized with respect to that quantity. As a result 
(although the rationalized M.K.S. system is preferred) the units 
do not enter in the formulae involved: the units of distance are — 
the same as those in which the wavelength is measured. 


(1) INTRODUCTION 


One of the many reasons why waveguides have not as yet 
been used for long-range communication is that in the well-— 
developed decimetric and centimetric regions the waveguides 
are too bulky, too lossy and too expensive.!»!© However, now 
that the millimetric region is being extensively explored the 
interest in waveguides for long-distance transmission has been 
revived.” 

For a long-range application, a waveguide, to be of any 
extensive use, must (apart from being an economic proposition) 
satisfy three conditions: it must have low attenuation (say 
3 dB/mile, or less); it must have adequate bandwidth-handling 
capacity (low dispersion); and it must be free from various 
multiple-echo effects associated with small discontinuities along 
the line. 

If the waveguide is operated at a frequency much above cut-off, 
large bandwidth-handling capacity is ensured3; at the same time 
low attenuation figures are obtainable, particularly if suitable 
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modes of propagation are chosen, e.g. the Hp,-mode in a hollow 
circular metal waveguide. This transmission system has the 
additional advantage of being relatively immune to imperfect 
joints between the individual waveguide sections. 

It transpires that a circular waveguide excited in the Hy,-mode 
does satisfy the above enumerated conditions and is therefore 
suitable for the application. Any other features of this trans- 
mission system are really incidental. Thus, for example, the 
indefinitely falling-off frequency-attenuation curve so often 
stressed is not really important and is an incidental feature 
common to many other transmission systems. 

From an engineering point of view, however, a transmission 
system satisfying the above-listed conditions brings with it a 
number of complications and difficulties. 

For example, to operate the waveguide at a frequency much 
above cut-off it is necessary, in order to avoid impracticably 
large waveguides, to work the system in the millimetric region, 
which is just in the course of engineering exploration. Further, 
the waveguide is capable of supporting a large number of modes 
(say 200 or more), leading at times to serious mode conversion- 
reconversion problems (e.g. in microwave component design). 

Further, the choice of the transmission mode, Ho, although 
the right one, is unfortunate in that this mode is (in a perfect 
waveguide) degenerate with the E,,;-mode. The result is that 
the Ho;-mode is unstable in a waveguide other than perfectly 
straight, leading to engineering difficulties with practical lines 
which, naturally, cannot be perfectly straight. 


(2) PRINCIPAL PROBLEMS ASSOCIATED WITH A LONG- 
RANGE WAVEGUIDE COMMUNICATION SYSTEM 

The problems peculiar to a long-range waveguide communi- 
cation system fall into three separate sections: the pipe line 
itself; the components for the pipe line; and the associated 
equipment. 

A typical long-range waveguide communication system is 
shown in Fig. 1. This comprises a large number of communi- 


Fig. 1.—Communication system employing circular waveguide excited 
in the Ho;-mode. 


cation channels feeding suitable amplifiers, whose output after 
an appropriate coding has been performed is made to modulate 
a source of microwave power, M. The microwave power is 
fed via a suitable transducer, T, into the pipe line. At suitable 
intervals along the line, repeaters, R, of appropriate construction 
are inserted. 

The line between the repeaters calls for a number of microwave 
components. The transducer is designed to transform the 
Hp,-mode in the rectangular guide into a pure Ho;-mode in the 
circular pipe with little insertion loss. Further, at many points 
along the line suitably designed bends and corners will be 
required, for example, to negotiate fixed obstacles. Needless to 
say, the pipe will necessitate a large number of suitably designed 
ioints along the whole of its run. Tapers will be employed 

‘whenever a change of waveguide dimension is required, and 
rode filters will be inserted whenever needed to combat mode 
-eonversion-reconversion effects. 

‘In the design of microwave components for the pipe line the 
ficulties are inherent to the system, arising mainly from multi- 
-p-oding effects, and there is therefore only one rule to be followed: 
‘the components must be designed with a low mode-conversion 


figure. The attenuation of the components as far as the Hp;-wave 
is concerned is of secondary importance, since the components 
generally occur at relatively infrequent intervals and consequently 
add inappreciably to the total line loss. 

The design criteria for the waveguide line itself, however, 
cannot be summarized in a few sentences, because of the large 
number of factors involved. More particularly, the signal 
distortion in a long-range waveguide transmission system as 
caused by the waveguide is due to several different phenomena: 
the attenuation characteristic, mode conversion-reconversion 
effects, multiple-echo effects, and dispersion. 

If the waveguide is employed for transmission of pulse-code- 
modulation (p.c.m.) information only, it can be shown? that the 
distortion of the intelligence (basically governed by the pulse 
distortion) as a result of attenuation is negligible in comparison 
with dispersion effects. Consequently, by using suitable coding 
(p.c.m.), the first phenomenon is rendered unimportant. 

Mode conversion-reconversion and multiple-echo effects, 
i.e. wave scattering at minute irregularities along the line, have 
been a subject of a limited study.4;° In general, the better the 
waveguide tolerances (other factors being equal) the less 
important these phenomena become. 

With lines of good tolerances the only mode conversion- 
reconversion effects of practical significance are those between 
the Ho,;- and E,,-modes, as brought about by a bend in the 
waveguide. 


(3) THE PROBLEM OF BENDS 


(3.1) General 


Communication via a waveguide excited in the Hog,;-mode 
(however low its theoretical attenuation may be) cannot be a 
practical engineering proposition unless and until a satisfactory 
solution to the problem of bends is produced. A waveguide 
used for communication between two points will necessitate two 
types of bends: the intentional bend and the unintentional bend. 
The first type arises when one wishes to make a deliberate and 
sharp change of direction of the propagation (e.g. to negotiate a 
fixed obstacle). The design of such a bend does not bring any 
formidable problems; even a substantial increase in attenuation 
in the bend is tolerable, because the overall performance of the 
system is little affected by it. 

The unintentional bend, on the other hand, is a different 
proposition: it may arise anywhere along the whole of the wave- 
guide run, and is unavoidable for practical reasons because it 
it is desirable for the waveguide to follow the contour of the 
ground (e.g. when laid in a trench), and in addition there is a 
limit to the precision with which a waveguide can be laid to 
approximate to a straight run. 

The two problems are distinct, but any modification of the 
waveguide which removes the degeneracy between the Ho,- and 
E,;-modes is a step in the right direction. However, whereas 
any such modification (which at the same time substantially 
increases the waveguide attenuation) may prove satisfactory for 
intentional bends, it is of no advantage for unintentional bends, 
since over a long run the waveguide would be considerably 
degraded in its performance. An example is a waveguide made 
of a poor conductor, which eases the problem of bends but 
degrades attenuation; elliptical waveguide is another example. 


(3.2) Review of Published Theoretical Approaches 


The propagation of a Ho,-wave in a curved circular waveguide 
was first investigated by Jouguet.? He analysed the field in the 
curved section of a circular waveguide interposed between two 
semi-infinite pieces of straight pipe of the same cross-section 
(see Fig. 2 overleaf). 
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PURE Ho, ~ MODE 


Fig. 2.—Curved circular waveguide. 


If the radius of curvature is not too small in comparison with 
the wavelength A and the pipe diameter 2s, the field in the curved 
region is obtained (substantially) by a linear combination of 
Ho)- and E,;-modes. The E,,-mode in question is polarized as 
shown in Fig. 3 and will be denoted by Ej}. 


PLANE OF 
BEND 


Fig. 3.—Degenerate modes of the circular waveguide. 
Lines of electric field. 


—-—-— Lines of magnetic field. 


The actual analysis of the problem, although straightforward, 
is rather lengthy, and only the results that have bearing on our 
further investigation will be discussed. 

The simplest case is that of a loss-free pipe. 
in the curved region is given by* 


pb, = A[H%o, cos (BRI) + jr/(DE7; sin(SBpl ei! — (A) 


where 4 is an arbitrary constant and dfr is given by 


Here the field 


Eqn. (1) has been plotted as a function of the bend angle 6 
in Fig. 4. It transpires that the nature of the field at a point in 


D STRENGTH 


re ANGLE OF BEND 0 


Fig. 4.—Field strength of the Ho;- and E7}-waves as a function of 0: 
DS ie 


the curved region is determined by @ rather than by the arc length. 
In particular, the field is always a mixture of the Hy,- and Ej,- 
modes, except at the points given by 


6 =n, (nodd) cor eieae Bh) 


_* Ho and &1, are algebraic symbols for the two different fields (Ho; and Ej;). This 
distinction is necessary, since the two terms in the brackets are not directly additive. 
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where the field is a pure E/,-wave, and at the points given by 
6 = m0, (meven) <i toihtaeeaeenieae 


where the field is a pure Hg;-wave. byes 
The angle 0. is called the critical angle and is given by 


os hodo 


c 4/2 


or a, 15548 degrees 07), Ml ne Uae 


7) radians... 4s, see 


Since, after a bend angle of 20, has been traversed by the 
wave a pure Hp,-wave is re-established, this property could be 
turned to advantage when designing intentional bends. However, 
as can be seen from eqn. (6), 6, depends on frequency and 
consequently a device utilizing properties of 0, is essentially a 
narrow-band device. 

Jouguet applied his analysis further to the investigation of 
propagation in metal pipes of finite conductivity. It transpires 
from his results that, for a pipe of approximately 3in diameter 
operated at a frequency of 35Gc/s, bending radii of the order 
of several miles already add substantially to the effective wave- 
guide attenuation. 

A different analytical approach was proposed by Albersheim.® 
It is based on the theory of coupled transmission lines. The 
coupled transmission line equivalent of a bent circular guide is 
shown in Fig. 5. The lines Ly and L; can carry power in Ho,- 


Fig. 5.—Transmission-line equivalent of a bent circular waveguide. 


and Ejj-modes, respectively, and there is a uniform coupling 
between the lines to simulate the effect of the bend. The power ~ 
is fed into the Lp-line and the actual proportion of power at the 
end of the line depends on the coupling coefficient as well as 
on the length of the line. Quantitative results are obtained by 
the application of circuit theory and can be shown to be in 
agreement with those obtained by Jouguet. This approach, 
though less rigorous, appears to be more versatile. 


(3.3) A New Approach 


The mathematical rigour of Jouguet’s analysis and the ver- 
satility of the equivalent coupled-transmission-line approach, | 
together with many additional advantages, are all contained in | 
the surface-impedance approach,? elucidated by the author some | 
time ago.!° 

The problem is formulated as follows. It is required to find 
an expression for the electromagnetic field inside a curved section 
of circular waveguide (Fig. 2) interposed between two semi- 
infinite straight pieces of waveguide of the same dimensions as 
the curved piece of guide, and the field at the entry to the wave- 
guide is assumed to be a pure Hp;-wave. The guide surface is — 
imagined to exhibit a surface impedance of the value Z,, and the | 
field components are then made to assume such values at the 
guide surface as to satisfy that condition. It is, in fact, a problem 
of wave propagation inside a curved cylinder of circular cross- | 
section whose surface impedance is Z,. It is assumed throughout 
that the absolute value of the surface impedance is a small — 
quantity. 

In the solution of the problem Z, appears as one of the para- 
meters. If, at this stage, all conditions to be satisfied by the 


KARBOWIAK: MICROWAVE ASPECTS OF WAVEGUIDES FOR LONG-DISTANCE TRANSMISSION 363 


waveguide (e.g. certain minimum allowable mode-conversion 
loss due to the bend) are inserted in the appropriate formulae, 
an equation determining the required value of Z, is obtained. 
The second part of the problem is then of a practical nature, 
namely to realize physically a surface giving rise to that value of 


surface impedance, and this, as will be seen, can be done in a 
number of ways. 


(4) INFLUENCE OF SURFACE IMPEDANCE ON THE PER- 
FORMANCE OF CURVED CIRCULAR WAVEGUIDE 


(4.1) Extension of Jouguet’s Analysis 


In the surface-impedance approach the analysis is—after an 
allowance has been made for minor changes in notation—the 
same as given in Reference 7 up to and including eqn. (51) 
therein. Eqn. (52) et seq. are, however, to be replaced by the 
equations given below. 

In these formulae all impedances are, unless otherwise stated, 
normalized with respect to the free-space impedance, iy. 
Further, if we let Cy and C, be the amplitude coefficients of the 
Ho,- and E;,;-waves, respectively, then the coupling coefficient 7 


is defined by* 
me Si 
ae 


E (7) 
(4.1.1) Propagation in Homogeneous and Isotropic Waveguides. 


In homogeneous and isotropic waveguides the guide wall is 
characterized by a surface impedance, Z,, such that 


Es Beebe cet | s (8) 
a Zz 
Z| ; ze | 


Using the equations appearing in Sections 3 and 4 of 
Reference 7 and inserting the appropriate values in the eqn. (8) 
above, the following will be obtained [the procedure is similar 
to the one used in obtaining eqn. (52) of Reference 7]: 


Aj ké8s Ko at i et iE Bs _ ho I 
J “ae ela day |e thR ko Jolhs) ©) 


Since h = hy + Oh and dh is a small number, and at the same 
time fp is the first root of J,(hs) = 0, consequently, on expanding 
the Bessel functions occurring in eqn. (9) in Taylor’s series 
about the point js and retaining the first term of each expansion 
only, we arrive at the following quadratic for 7: 


tee yp ee ee Ee aU) 
h Re 
where Q= Bis Tot nee” Seat (LL) 
ony 
Eqn. (10), of course, has for its solution 
To ace (De tee oe | an E2) 


There are thus two solutions for 7 and correspondingly two 
solutions for h and y. In other words, the Ho;- and E,;-waves 
are unstable in the curved region, and the two independent and 
stable solutions are 


by = A(%o + CE et 


i (13) 
ib, = A Ho, + 728 ye 


Evidently the nature of the solution depends on the magnitude 


of p. 
If p is small (p < 1)—and this occurs for sufficiently small 


* + has been used instead of K, as in Jouguet’s analysis. 
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values of surface impedance in relation to the radius of curvature 
—then 


Tak Tony V2 oy Dd ations, Saul 


The corresponding value of d(h) is obtained on substitution 
of expression (14) into eqn. (9). Thus, to the order indicated, 


2hoko —_ h?Ry/2 ; ‘ 


The first term in eqn. (15) is the mean of perturbation terms 
(due to Zs)® as would be obtained for a straight pipe when 
carrying the Ho,-wave [term d(A)zy] and the E,,-wave [term 
d(h)ze]. The second term is the effect of curvature [term 5(/),]. 
Thus 


dh), d(A)2 


3(0);,8(H), = Wen + ze 5H, . , 06 

3) send 
and? Sy), 8()2 = (zn 5 (ze ~ jE a) 
where (Pp = me sty gic ethan CLS) 


Here S(y)zy and S(y)z x are the perturbation terms in a straight 
pipe (due to Z,) when carrying a pure Hg,;-wave and a pure 
E,-wave, respectively.” 

For Z, = 0 these expressions reduce to the corresponding 
expression given in Section 3-2. As a particular application of 
these equations when Z, ~ 0, consider the case of a metal pipe 
of conductivity o. In this case the surface impedance is given by? 


Z,=0+afCB)\Z=R4i% - . 9) 


and for this value of surface impedance the equivalence of the 
above equations and those obtained by Jouguet is easy to 
demonstrate. 

To understand the nature of the wave in a curved waveguide 
(Fig. 2), let us determine the coefficients A; and A, in eqn. (13). 
Let the wave for 0 = 0 be a pure Ho -wave, given by 


bb = Aye? ca eee tte een (20) 
At 0 = 0 we equate eqns. (13) and (20), obtaining 


72 


airs 17 
(21) 


, ai 
and A> sail sete Lill eh i 
igen 


Consequently the field in the curved guide is given by a 
mixture of Ho, (¢bz) and Ej,(_) waves, and hence 


by = se s em ad 760) 
(22) 


Tig Ta 


and be = (e—Y2! — g—%1!) 


Lis ae etl 
For small p we get for the total field 


th = thy + bp ={M cos [8(B)] +IV/(DE{, sin [B(B) al] fe" 
A (23) 


) ) 
where vo = j8 4 eS ()ze (24) 
The division of energy between the Ho,- and E,;-waves is 
thus as shown in Fig. 4, but in addition the whole waye com- 
13 
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bination suffers an attenuation which is an arithmetic average 
of that for Ho;- and E,,-waves; similarly the phase velocity of 
this wave combination, ys, is the mean of phase velocities for the 
Ho;- and E,;-waves. We note, further, that to the first approxi- 
mation (p very small) the surface impedance has no influence on 
the relative phase velocities of the component waves (Ho; and 
E,,), and therefore it does not remove the degeneracy between 
the two waves concerned. 

If p is not sufficiently small, eqn. (14) does not represent 
eqn. (12) adequately and consequently eqn. (23) is not applicable. 
Inasmuch as the case of moderate p could be dealt with exactly 
by using the values for 7 as given by eqn. (12), the case is not of 
great practical interest. Accordingly, from now on we shall 
concern ourselves with cases for which p is a large number (say, 
in excess of 10), which is of great importance in that it has a 
direct bearing on the design of waveguides for long-distance 
transmission. 

If p is large, then from eqn. (12) we get 


1 
Us Up SC ae é), 2p(1 ar é) (25) 
where € is the remainder of the series whose first term is 
1 
ers 772 (26) 


2p 


and this, unless otherwise stated, will be neglected. 
Substitution of the coupling coefficients as given by eqn. (25) 
into eqn. (12) gives 


bh (A + o(/ 
Bad = | ee ia eae 
2 ZE 
ih 1 hes OO HE NZ 
h S(A)p =i : : 28 
Mg r= J ZR he Ge oe 
Lh 
and d(A)zy = e i Zs) 
1 . ae 
25 Ayo 
S(A)ze = : p> 


In these expressions the various perturbation terms will be 
recognized as: d(h)zz, the perturbation term of a straight pipe 
due to Z,, when carrying a pure Ho;-wave; d(h)z,, the pertur- 
bation term of a straight pipe due to Z,, when carrying a pure 
E,,-wave; and d(h)p, the perturbation term which is a combined 
effect due to Z, and guide curvature and has meaning for R > ©. 
(In virtue of the approximations previously made 8(h)z has no 
significance for either Z, > 0 or R > 0.) 

The perturbation terms of the axial propagation coefficient 
corresponding to eqn. (27) are) 


fal 2 eae + nel 
d(y)2 d(y)ze 


where O(y)zy and 6(y)zgz have meaning as explained previously, 
and are given by 


(30) 


ZH s koBo Ss a 
lh 
r) —— ed LY) 
(y)ze s Bo~® 
while the perturbation term due to curvature is given by 
—_ 1 ss kB ko\?R, — jX, 
SYR = aR R Be (ze) AR (62) 
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(4.1.2) Nature of the Wave in a Bent Circular Waveguide. 
To examine the characteristics of the field inside a curved 
circular guide we use eqns. (25) and (30) in eqns. (22), obtaining 


by ={G — + € exp [8(y)2 — S(y),]/} exp (— yl) (33) 


and 


yee afexp [— 8(y)o1] — exp [— 8(),1]} exp (—/81) 34) 


It will be recognized that eqn. (33) represents a ‘pure’ Ho;-wave 
whose amplitude, apart from suffering an attenuation, varies in 
an oscillatory manner about the mean value (1 — é). But, 
since p was assumed to be large (say, over 10), €[ = 1/(2p)] is 
a very small quantity, and consequently these variations in 
amplitude are negligibly small. Eqn. (34) represents an E,,;-wave 
whose amplitude (of the order of 1/p) varies in a periodic manner, 
apart from suffering the usual axial attenuation. For R—© 
the quantity 1/p > 0 and € > 0 and consequently #4,—> 0, while 
the amplitude of the Ho;-wave becomes unity, thus giving the 
solution for a straight pipe. The amplitude of the Ho,;-wave as 
a function of arc length is shown plotted in Fig. 6. The distinct 
nature of the field in a curved guide for the two extreme cases 
[of p large (Fig. 6) and p small (Fig. 4)] cannot be over-empha- 


FIELD STRENGTH 


ARC LENGTH 


Fig. 6.—The amplitude of the Hoi-wave in a circular waveguide: 
p>l. 


sized, and it is the case of large p that will occupy our attention 
from now on. 

We have shown above that, provided that p is a large number, 
the wave in a curved guide will be a substantially pure Ho,-wave. 
This, however, will have a propagation coefficient, y,, given by 


¥1 = JB + 8) (35) 


Thus, the attention of the Hp,-wave in the curved guide, under 
these conditions, is increased by the amount 


ar = Ald(y)z] 
as 1 s ko 2kB Rs 
2R RAB, he |Z,|? 


(36) 


1.e. XR 


The attenuation of a straight guide when supporting the 
Ho;-mode is given by? [cf. eqn. (31)] 


Sree nig 
22 ok Bite (37) 
Let us introduce a ‘quality factor’ of a bent guide defined by 


Ve ae ee 


a al ia NE ' 

2LR\Io/ |Z,| 
This quantity, expressed as a percentage of azy, is shown 
plotted in Fig. 7 as a function of Ap with |Z,|R as a parameter, 


We then find that 


(39) 


a r 


Eee eee 
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Fig. 7.—Chart for the computation of the quality factor, v, of a curved 
circular waveguide of 7cm diameter. 


the curves referring to a guide whose diameter is 7cm. Thus, 

_ using this chart we find that to negotiate a bend at Ay = 8-Omm 
and v = 10% we must have the product |Z,|R = 1600 ohm-m. 
Consequently, a bend of 800m radius is possible provided that 
|Z,| > 2 ohms. 

Eqn. (39) is significant in that it shows that for a given quality 
factor v the maximum permissible curvature is proportional to 
the absolute value of the surface impedance. It thus transpires 
that a large value of surface reactance definitely improves the 
performance of the bent waveguide. 


(4.1.3) An Application. 

The important point, yet to be discussed, is how to provide 
for the necessary surface impedance. This can be done in a 
number of ways, but in this Section we shall concern ourselves 
with one method only, namely coating the inner surface of the 
guide with a thin layer of a suitable dielectric. 

It can be shown that coating a metal] surface with a thin layer 
of dielectric is an effective means of enhancing the surface 
reactance,” !! without appreciably affecting the surface resistance. 
The resulting surface impedance is with respect to an E-wave; 
that of a coated surface with respect to an H-wave is different.? 

This at first gives rise to a difficulty, which can, however, be 
circumvented by the application of an analysis akin to that 
indicated in Section 4.2. But, to keep the argument as simple 
as possible for the purpose of explaining the modus operandi, it 
is permissible to use the expression for the surface impedance 
as derived for E-waves, leading to results which are a good 
approximation, particularly in the case of waveguides operated 
at a frequency much above cut-off. 

The surface impedance of a metal surface coated with a thin 
layer (thickness f) of a dielectric (of relative permittivity «, and 
loss factor tan 5) with respect to an E-wave is? 


Z,=Zy+Za (40) 
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where Z,, is the surface impedance of the metal surface in the 
absence of coating, namely 


Zm = (1 + Dal (ZB) F (41) 
and? 10 . 
Za= itkao| (@) — (| (42) 
1 0 


For waveguides of large diameter, the second term in the 
square brackets of eqn. (42) is small in comparison with the 
first and may accordingly be omitted; this results in considerable 
simplification of all subsequent formulae. Thus, for example, 
with a 7cm diameter guide operated at 8-7mm the error in Zz 
on that account is of the order of 2°%, which is entirely satis- 


factory for any applications that we may contemplate. With 
this approximation in mind we have 
Za = Ra t+ jiXg 
tan 5 (43) 
=kpo = t+ jky(1— 2) 
é, é, 
Consequently 
R, = R,, + Rg 
diel . ti tan 6 
<ZN Ga) a! a 
and D. Ci, Cn oe E, 
a ‘i TfL 1 
=ZN (7) + h(t a) - 
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Fig. 8.—Surfacefreactance Xqzof a dielectric-coated surface 
per 0-001 in thickness of coating. 


Fig. 8 shows the variation of X, as a function of frequency with 
€, aS parameter. The quantity X,, = R,, is plotted in Fig. 9 for 
three different metals: copper, aluminium and brass. 

For any practical application it is important for the attenuation 
of a straight waveguide to remain approximately the same; i.e. 
it is required that R, shall be small (Rz < Xz). Whether this 
condition will be fulfilled or not depends entirely on the relative 
values of the permittivity and the dielectric loss. The ratio Rg/Xq 
is given by 


(46) 


As an application, consider the following numerical example. 
What must be the coating thickness of a uniform layer of poly- 
thene applied to the inside surface of a copper tube to enable the 
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Fig. 9.—Surface impedance of metal surfaces. 


guide to follow a bend, whose radius is 400m, with an increase 
in attenuation not in excess of 10%? 

Using the data contained in Figs. 7-9 we find that if the tube 
is provided with a 0-001 in thick layer of polythene (e, = 2-2, 
tan 6 = 0-0007), the above requirements will be fulfilled. The 
tremendous improvement thereby achieved can be seen by con- 
trasting the performance of this guide with that of a plain copper 
guide: in the latter case a bending radius of less than 20km 
could not be tolerated. Fig. 10 shows the relation between the 
thickness of coating and the radius of bend for three values of v. 
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Fig. 10.—Allowable radius of bend for a given quality factor » as a 
function of thickness ¢ of polythene (¢, = 2:2) coating. 


Ay = 8-7mm. 
2s = Guide diameter = 7cm. 


Let us view these results in yet a different light. It is not 
possible to install a waveguide line of any appreciable length 
in an optically straight path; for a number of reasons, the line 
will deviate from its straight course in a rather random way. 
In Fig. 11, the deviation in a span S is denoted by d. If d is 
small, our previous results can be applied, where R is the radius 
of curvature of the unintentional bend. We then find that, in 
the above example, if S = 10m, for a plain copper pipe d must 
not exceed +0°6mm, while in the case of the coated pipe the 
allowable limits of d are +3cm. It is thus evident that while a 
pipe cannot be laid to a precision of +0-6mm, the tolerances 
of +3cm are ample. 

In the last example eqn. (46) shows a 1% increase in the value 
of surface resistance, leading to 1% increase in attenuation of 
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Fig. 11.—The unintentional bend. 


the E-wave in a straight pipe, and if the surface impedance for 
the H-wave were the same, as assumed above, the attenuation 
of the H-wave would increase by the same amount: in actual 
fact this is not so and the attenuation of the H-wave increases 
by much less than this figure. 


(4.2) Performance of Anisotropic Waveguides 


(4.2.1) General. 

A surface is called anisotrupic when its surface impedance is, 
among other factors, dependent on the orientation of the surface 
with respect to the field components, and a waveguide whose 
surface is anisotropic is termed an anisotropic waveguide. The 
theory of anisotropic waveguides has been a subject of separate 
treatises, and to avoid needless repetition reference to them will 
be assumed.” !3 

The boundary conditions to be imposed in the case of aniso- 
tropic waveguides are 


Ee 
fi),=% 
(47) 
ISU 
and - | = , 


Where Z4 and Z, are the circumferential and axial components 
of the surface impedance, respectively. 

The boundary value problem is solved in the manner indicated 
in Section 4.1.1 leading to eqn. (9), with p given by 


P= 3° 12x) Sonne nen 


where 


(49) 


Now, it has been shown elsewhere? that the propagation of 
E-waves is influenced by the axial component of surface impe- 
dance, Z,, only, but that the propagation of Hpo-waves is 
unaffected by this impedance; the latter is influenced by the 
circumferential component of surface impedance, Z4, only. 
Consequently, in eqn. (49) we make the following identifications: 


Zz = Zz E 
Zz Z } (50) 
23 = L7H 
For large-diameter pipes |h| < k and consequently (since we 
are interested only in cases where Z, is of the order of 2,08 
less) we have substantially 
ZR~ Z,= LzE (51) 
The wave in an anisotropic waveguide is of the same nature 
as that in an isotropic one and is given by eqn. (13). The 
coupling coefficients are as given by eqn. (12), but with p as 
given by eqn. (48). Again, as with isotropic waveguides, the 
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case of large p is of particular interest. The perturbation terms 
of the wave propagation coefficient are given by 


ine Ps hee a a 
d(y)2 S(Y)ze 


(52) 

where? 
30)2n = 20 I (53) 
y)ze = =: : 4 (54) 


It will be recognized that 5(y)z,,; and 8(y)zz are perturbation 
terms due to surface impedance (in the absence of curvature) for 
a guide carrying a pure Ho;-wave and a pure E,,-wave, respec- 
tively. 

Further, the quality factor, v, of a bent guide is given by 

OR 


“ZH 

se ko\? 1 f Re 

2|R 7) |\Z,|_] Re 
and consequently the chart of Fig. 7 can be used for computations 
on anisotropic guides provided that allowance is made for the 
factor R,/ Rg. 

At this stage it is instructive to examine eqn. (56) in greater 
detail. In particular, it will be observed that the quality factor 
for an anisotropic waveguide depends on the real value of Z, 
’ in addition to its absolute value—in contrast to the isotropic 
waveguide [eqn. (39)]. 

There are two cases of practical interest: (a) if Z, is made 
predominantly resistive (|Z,| ~ R,)—this is a case of selective 
attenuation—then for a given quality factor v, the maximum 
permissible curvature will be found proportional to »/|Z,|; 
(b) if, on the other hand, Z, is made predominantly reactive— 
this is a case of reactive loading—then the curvature will be 
- simply proportional to |Z,] (i.e. X;). 

It thus transpires that there are two distinct types of anisotropic 
waveguides suitable for our purposes: (a) a waveguide in which 
a deliberately large amount of attenuation is introduced for the 
E-waves, leaving the Ho;-mode unaffected, and (6) a waveguide 
with reactive loading of the E,,-wave in order to break up the 
degeneracy between the E,,- and Ho;-waves. But the choice 
of the waveguide for any practical application is mainly a 
matter of engineering convenience. 


(56) 


(4.2.2) An Application. 

An anisotropic surface is readily produced. In fact, any 
surface suitably treated, so as to result in a periodic structure 
of the surface in a particular dimension only, gives rise to an 
anisotropic surface impedance, and a corrugated surface is just 
one of many possible. 

As a numerical example consider the following problem.!> 
What is the improvement achieved in terms of minimum allow- 
able radius of bend of a 7cm diameter copper pipe, operated at 
\y = 8-7mm, which has been provided with circumferential 
corrugations, as shown in Fig. 12? Take the dimensions of corru- 
gations to be: t; = 0:05mm, ft, = 0-5mm, 1=0:2mm. Because 
of the particular proportions chosen for the corrugations in 
relation to the wavelength, it is permissible to proceed with a 
number of approximations” 14 Tn particular, we can use eqns. 
(10) and (11) of Reference 6 and we can assume that the field in 
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Fig. 12.—Longitudinal section of corrugated pipe. 


the corrugations is substantially in the lowest mode of the so- 
formed short-circuited piece of the radial transmission line 
(length /). We then find that 


ZzEZo => Z,Zo = 0-083 + j4-95 ohms } 


37 
ZznZ0 = Z3Zo == (0057/5) el + j) ohms ( ) 


When these values are inserted in the relevant formulae we 
find that a bending radius as small as 400m, at the expense of a 
10% increase in attenuation due to the bend, is possible. Since 
attenuation of the pipe itself, when straight, is 10% higher, 
there is therefore a total 20% degradation. This can be con- 
trasted with the performance of a plain copper pipe, which for a 
20% increase in attenuation does not permit bending radii 
smaller than 14km. It will be noticed, however, that there is a 
measurable degradation in the performance of the straight wave- 
guide (10%), which was not the case with the dielectric coated 
pipe, and it is probably safe to say that all practical forms of 
anisotropic waveguides are somewhat inferior in their per- 
formance in straight runs, but that the degradation can usually 
be brought to a tolerable figure (say 10°%). 


(4.2.3) Waveguides of Helical Anisotropy. 


Although a corrugated waveguide, as described above, 
satisfies the need adequately, the structure is not easy to manu- 
facture. A more acceptable form of waveguide is, for example, 
in the form of a tightly wound helix (of very short pitch) made ofa 


¢ 
(b) 


Fig. 13.—The helically anisotropic waveguide. 


conducting wire with adjacent turns touching (Fig. 13). Before 
such a guide is put into commission it is necessary to ascertain 
whether the wave modes involved (E,;; and Ho;) are stable in 
such structures. This can be shown to be the case,!? although 


i i lindrical casing containing a helically wound (copper) wire. 
circumstances higher-order H-modes are unstable. Cy ng 
Te ee Sates will he our concern. ; For some applications the turns are in mutual contact and for 
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The principal components of the surface impedance (Fig. 13) others the wire may be insulated. Another possible con- | 


: t 13, 15 struction!5 is as shown in Fig. 14(6); here the helix is embedded 
Bea: ee teeta Ween in a suitable medium, e.g. a dielectric. Further, instead of round 
Zs = (Zn + Ze tan? P)/(1 + tan? yf) | (58) Wire a conducting tape may be used, while for other applications | 
and Z, = (Ze + Z, tan? J)/(1 + tan? w) i it an anodized aluminium wire gives excellent performance. From | 
, ae jal n 


the engineering point of view an important feature shared by | 
The performance of waveguides of helical anisotropy (h.a.) many h.a. waveguides is that they can be made in long length | 
when bent can be thus predicted from the formulae given in by a continuous winding process. 
Section 4.2.1 with Z, and Z, given by eqns. Se ‘ " 
In particular, if the lay angle of the helix, , 1s small, as 1 ep ce i 
would be in any practical design, eqns. (58) can be approximated (5) CON ' 


b An outline of a long-distance communication system employing i 
‘ Zee ZanZe we waveguides has been presented and various problems peculiar f 
roe : 2} (59) to the system have been briefly discussed. The microwave 
Z,~= Z,+ Z,P aspect has been dealt in great detail and in particular the wave- 
Further, since for any practical design we would have guide—the medium of communication—has been closely 
|Z-| > Z, examined. eee! 
: ; J eZ, Problems associated with the waveguide line have been con- 
ceaanGs 2 (60) sidered individually and the inadequacy of plain metal wave- 
Zy eZ + Zep guides for long-distance application has been stressed. With a 


as if Z and iz components were £1 en by eq. (60) This J tr . - has b cen. de elop C d and 
Zz p V Nn. . 


ion i ; : i f such Titeria for waveguide design have been formulated. 

Saran See ea a Seatracl Lae caaea a oa Finally, practical engineering solutions to the problem have 
ee oe many significant points contained in eqn. (60). In been produced and a variety of suitable waveguide designs dis- 
the first place it will be observed that for a helical waveguide, in cussed. The CONICET HUE att ae us  Wayeatiey Pers 
contrast to waveguides discussed previously, the attenuation eae thereby achieved has been illustrated by 

is proportional to R, + Ry ¥, and consequently, other factors ©X4mpies. 
being equal, such a waveguide is inferior by ate factor Rep (6) ACKNOWLEDGMENTS 
Since, for practical waveguides, ¥* would be 10~4 or even 10~°, , ; 
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Ref? would be negligibly small. With a selective attenuation communication Laboratories Ltd., for a number of interesting 
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THE DETERMINATION OF CONTROL SYSTEM CHARACTERISTICS FROM A 
TRANSIENT RESPONSE 


By J. P. ELLINGTON, B.Sc., A.M.I.Mech.E., and H. McCALLION, B.Sc., Ph.D., A.M.I.Mech.E. 


(The paper was first received 7th May, and in re 


SUMMARY 


Measured values of the output of a control system are used to form 
a characteristic equation from which the overall transfer function 
may be found. To use the method it is necessary to know, or assume, 
the form of the transfer function, and also to have a knowledge of 
the input form. In this manner parameters of existing systems may 
be found either with a view to improving the system or providing data 
for future designs. A worked example indicates the possible order of 
accuracy. 


LIST OF PRINCIPAL SYMBOLS 


6;, 8, = Control system input and output. 
x(t) = A measure of the system output. 
Xo, Xj, Xp, etc. = Measured values of x(t) at time intervals 0, 7, 
ELS UE 
Pp; = Characteristic exponential exponent. 
A; = Coefficient of exp (p;t) in expressions for x(f). 
Z; = Value of exp (p;T). 
Xj; = Normalized values of x, x,, etc., from least- 
squares equation. 


(1) INTRODUCTION 


In the operation and analysis of existing control systems, one 
of the most important requirements is a knowledge of the 
relationships between input and output. This relationship, the 
overall transfer function, which is of fundamental importance, 
may then be used for a number of purposes. The response to 
possible variations of input or external disturbances can be fore- 
cast, and further, if this transfer function is known in an 
analytical form, as opposed to a graphical form, the effect of 
changes in the system’s parameters can be examined with a view 
to making improvements, without disturbing or interfering with 
a system already in operation. In addition, the determination 
and knowledge of the parameters of existing systems could be 
of great use in the design and synthesis of future systems. 

One widely used method of analysis is to determine the fre- 
quency response of the system, or the steady-state output and 
phase difference due to a sinusoidal input. Apart from experi- 
mental difficulties, the time required for full analysis may be 
uneconomic or prohibitive and the information obtained is 
often of a limited nature. The interpretation of the results may 
also prove difficult, although approximate methods have been 
derived! whereby transient responses may be estimated from the 
frequency response. 

A rather more fruitful method is to excite the system by means 
of a pulse or step input and to observe the transient output. This 
transient may then be analysed by methods such as the Fourier 
transform or Laplace transform to give the transfer function, 
but these methods are not suitable for systems which may be 
subject to uncontrolled disturbances, since they demand inte- 
gration of the transient over an interval from zero to infinity. 
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Shinbrot? gives a résumé of these methods in connection with 
the motion of aircraft, and also gives a variation of the Fourier 
transform method for use with observations over a limited time 
range. However, for systems composed of several components 
these methods usually require an excessive amount of numerical 
labour. 

The authors’ intention is now to restate a method of com- 
pletely determining the parameters of a linear system from 
observations of the transient output produced by a known 
impulsive or step input, the method allowing the system’s 
normal modes and transfer function to be found. 

The determinantal form of the characteristic equation given 
below was first quoted by Sir. J. J. Thomson? in connection with 
the determination of X-ray absorption coefficients, and although 
he pointed out dynamical applications, his work does not seem 
to be generally known. This original equation, however, 
demanded that measurements should be exact and free from 
errors and noise, and in looking for suitable modifications the 
authors found that Thomson’s work was a special case of a 
method of Prony* for interpolation by means of exponential 
functions. This latter method has been briefly mentioned by 
Shinbrot, but despite the necessity for adequate means of 
analysis it still seems to remain unnoticed. In view of this a 
new proof of the method suitable for control system work has 
been prepared, the results being presented in a manner suitable 
for computation. 


(2) DERIVATION 


Considering now an actual control system subjected to only 
one input, it is assumed that mathematical representations of 
the various system elements are known, the problem being to 
determine the physical constants involved. In general, if these 
elements are linear, the relationship between input and output, 
6, and 6,, is given by the differential equation, 


d” q’-1 
(ane Te Serres Geo a a9) 


dm qm-1 
om Ce + by ag tee + bo)® . () 
where n > m. 
Taking the system to be quiescent at time t = 0, the Laplace 
transforms of each side of eqn. (1) may be equated to give 


bnp™ + by, 1p"! +... £6 
0 —_ m m—1 °) 
AP) ee yep" Ue eee 


= Y(p)6(p) 


The overall transfer function Y(p), may then be resolved into 
partial fractions to give 


G C C, 
O,(p) = : ah 2 SP ews £ 
Car Lb Sal O53 hts melon 


9i(p) 
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The n roots of the characteristic equation, Pj, Py... Dn, May 
be real, imaginary or complex, it being assumed that no two 
roots are equal. 

The actual form of the output 6,(t) obviously depends upon 
the type of disturbance which can be applied to the system. For 
instance, if a unit impulse can be applied then 6(p) = 1 and 


O,(t) = C, exp(p,t) + Cy exp (pot) +... + Cr exp (prt) (3) 


If, however, a unit step input, 0,(p) = 1/p, is used, it is con- 
venient to refer measurements of output to the steady-state 
output, 6,, = bo/ap, and write 


8,(t) — O45 = (Cy/P1) exp (pit) + (Co/p2) exp (po1) 
“wee CHa) XD (Pat). (4) 


Eqns. (3) and (4) are of the same form, and similar types of 
expressions can be written for other known forms of impulsive 
excitation. Thus, in general, some measure of the output, 
say x(t), can be written as 


x(t) = A, exp (pyt) + Az exp (pot) +... + A,exp (ppt) (5) 


where the constant terms A; are related to the transfer function 
in a known manner through the input transform. 

Suppose now that from the output record a total of, say, 
(s + 1) > 2m measurements of x have been taken at equal time 
intervals T, at t = 0, x being xo, at t = T, x being x, etc., then 
from eqn. (5), 


Xo = 4y +42. +..1 +4, 1] 
x1 = AjZ, -F ALZ, +. + AnZ, 
Xp = AZ A AZ AcZe + 
nee = Ai Zk + AZ +, An Zs (k= 0,1, 2, « . 058) 
where Z, = exp (p;T), Z, = exp (p2T), etc. . (7) 


If the set of eqns. (6) are compatible, or self-consistent, then 
the constants A, to A, can be eliminated from the first (7 + 1) 
equations to give the determinant D,: 


Ben, CA 6.06 
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and generally, eliminating the A,’s from the (7 + 1) equations 
commencing with x;, 


pe was 
= Xj+1 Zi 
Dj+1 = 0 | : ees Mee 
pin Ziska 
xj 1 1 
ee Xj+1 Zi Z> 
Xjtn Li Zs awn. . e (8) 


In this way a total number of (s + 1 — n) determinants may 
be formed, and if the system under consideration were perfectly 
linear and free from noise, whilst the measured values were 
gubject to no errors, any ” consecutive determinants could be 
taken and used to form a characteristic equation for the roots Z. 
tn practice, however, noise and slight non-linearities do occur, 
measurements are not always perfect, and it is necessary to 


»roceed in the following manner. 
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Taking the (s + 1 — xn) determinants above, each one may 
be expanded in terms of the elements of its first column to give 
the following set of equations: 


0 = Boxo + B,x, + sane + Bae 
0= Box + ByX> se = Bx 


0 = Lepage + SX + cee =5 BAX 


pe oeen to) 


The constants, B, in eqn. (9) are functions of the required 
roots Z, but owing to the errors in x the equations are not con- 
sistent, and their number exceeds the number of unknowns. 
The best estimate of these unknowns can be found by solving 
eqns. (9) by the method of least squares, the resulting (7 + 1) 
normal equations being 


0 = BoXo0 + B, Xo + or ere) + B,Xon 
O = BoXjo + BXy, +... + BXin 


0 = BoXigo + Bi Xi, + - oO 
te— (OM eee) 


The coefficients of the unknown B’s may be found syste- 
matically from the following matrix product: 


+ BX; (10) 


Xoo Xo1 . ° Xon Xo Xy + 2 © Xs_n 
X10 X14 See Xin sas Xi XQ 2 t + Xs_ni1 x 
Xng AGI - a AG Xn Xnt1 + + + Xs 
Xo Xi - + + Xn 
X1 XQ... Xn+1 
x X2 X3- Xn+2 
Xs—nXs—nt+1+ + + Xs ° (11) 
The symmetry of this matrix should be noticed, X;; = Xj, 


(i ~ J), either as a check, or a means of avoiding arithmetic. 

Taking the first n equations in the set above, each one may be 
rewritten in determinant form, as eqn. (8), and then manipulated 
in the manner given in the Appendix, to give a characteristic 
equation, a polynomial of degree n, with real coefficients from 
which the required roots Z may be found: 


| Xoo Xo1 Xon—1) 
Z Xo Xy » Xin—-1) 

= |Z? Xo Xr - X2(n-1) (12) 
iS Xn Xnt Xn(n-1) 


Solving eqn. (12), the required exponents are found from 
eqn. (7). If the root Z is real 


p = (1/T) log. Z 


whilst if the root Z is complex, being one of a conjugate pair, 
say (« + jf), then 


p = (1/2T) log. («* + 6) + j(I/T) arc tan (B/a) 


Having calculated these exponents it is possible to return to 
eqns. (6), which are now a set of linear equations in the unknown 
A’s, which again may be solved by the method of least squares. 
Thus, knowing the form of the input, the overall transfer function 
may be found, 
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(3) APPLICATION OF METHOD 


The use of the above method, and some of its limitations, 
may best be shown by a fairly simple numerical example. 
Taking a system having an es transfer function 


YO = GET + oR FIMO 
subject to a unit step function 0,(p) = 1/p, the output transform is 


16 9p — 17 | 
BCP) = AG +)’ @+e 41 


andthe system output to this unit step input is 
0,(t) = 0:64 exp (—f) + exp (—1/4)(0-36 cos t — 0-77 sin f) 
this equation being graphed in Fig. 1. 
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Fig. 1.—Unit step response of system of transfer function 
P?/(p? + 4p + 10). 


To simulate an actual record, calculated values of response 
were rounded off to two decimal places and used as measured 
values. Using a time interval of T = 4sec over the first 6sec of 
the run gives values of x) = 1-00, x; = 0°34, x» = — 0-12, etc., 
at times t = 0, 0-5, 1-0, etc., it being taken that the steady-state 
output is the zero position. 

These values substituted in eqn. (11) give the normalized 
values of the displacement and the resulting characteristic 
eqn. (12) becomes 


1  1:6830 0-8076 0-071 2 | 
o = |Z. 078076 0-7406 —0-5156 
Z* 0:0712 0°5156 0:6650 
Z3 —0:4368 0:2224 0:5824 


Expanding this determinant, or solving otherwise, gives, 
Z, = 0°60482 and Z,, = 0-768 36 + j0-42044 
From eqn. (7) the exponents p are given by Z = exp (pT) or 
P, = 1:00576 and p,; = 0-26508 + j1-0012 


The actual values are pj = 1:00 and p,; = 0-25 + j1-00; 
it is seen that the major error occurs in the damping factor of the 
complex delay term. 

Returning to eqn. (6), 


xy = Ay exp ( — 1-005 76KT) 
+ exp (— 0-265 08kT)[A, cos (1-001 2kT) + A; sin (1-001 2kT)| 
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the resulting equations, solved by least-squares methods, giving 
A; = 0-60569, A, = 0°39760, A; = — 0°78750 


Thus, recalling that the input was a unit step function, the 
overall transfer function is 


y: 0-605 69 0-397 60p — 0-683 05 | 
MP) = Pl Cy 1-00576) + (p+ 0-26508)2 + (1-001 2)? 


1-003 29p? + 0-037 95p — 0-037 29 | 
lg + 1-005 76) (p* + 0:53016p + 1:07267) 


Comparing this result with the actual transfer function it will 
be seen that small terms containing p and p” have appeared in 
the numerator. This is a general feature of the method, and is 
of course a consequence of using inexact data. However, if 
the numerical values in the calculated transfer function are now 
rounded off bearing this point in mind, a good approximation is 
obtained: 


p? 


(p + 1-01) (p? + 0°53p + 1-07) 


It is most important not to carry out this rounding-off process 
until the final solution is obtained, or gross errors may be 
accumulated throughout the work. Again this is a feature of 
least-squares methods, and all possible figures should be retained 
in the calculations. 

If the initial data were taken with three-figure accuracy, then 
over the first 6 sec of the run the characteristic equation becomes 


Y(p) = 


1 1-:678025  0:°805218 0-071 604 
Ole 0-805218  0:738050 0:514198 
Z* 0:071604 0:514198  0-663385 
Z> —0-436414 0:220719  0-580344 


yielding values of the exponents 
P; = 0:9990 and pz 3 = 0-2513 +71-0000 


Here the agreement with the original data is very good indeed. 
In addition, a comparison of this characteristic equation with 
the previous one demonstrates that rounding-off may introduce 
errors in the final solution. 

Apart from the accuracy of the given data, the choice of 
time interval T will also influence the accuracy of the solution, 
but there is no simple rule governing this choice. Obviously a 
compromise must be made, since too large or too small a time 
interval can reduce the order of the characteristic equation by 
failing to resolve the effects of small or large delay terms. 


(4) CONCLUSIONS 


Measured values, at equally spaced intervals, of the initial 
response of a control system to a known input are used to form 
a characteristic equation from which the overall transfer function 
may be calculated. This characteristic equation, based on 
Prony’s method of exponential interpolation, is formed by a 
least-square method to obtain a best estimate from measurements 
subject to noise or errors. 
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(7) APPENDIX 
A form of the characteristic equation (12) has been derived 
previously by the authors,> and a similar method has been used 
by Vettin® for the integration of non-linear equations. 
Taking the first » normal equations (10), each one may be 
rewritten in determinant form as eqn. (8): 


1 


Xepeww ileaneet 
Di—0= EO) Vite 4) 
Seay ves 
bree phe 
T= 0 ae ee acer 2 etc. 
Xin | zt 7 Zz 


Now multiply determinant D, by &,, determinant D, by &, 
etc., the €’s remaining undefined, and then take the sum of these 


determinants to give 


(E;Xoo + 62X10 + +>) 1 | [Ph ee Des 
Qc (SiN exe een Zee - 


aye OZ Ze att 


(13) 


The values of € are now to be chosen so that, if possible, the 
elements of the first column of eqn. (13) each become an integral 
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power of some function or quantity A. For convenience, use is 
made of the symmetry of eqn. (11), Xj; = Xj; (i ~/); then 


E1Xo0 + £oX19 +... =A? 
E, X10 + &X4 + amen, = Al (14) 
GX. + Xm Fo = 


There are thus (m — 1) equations for the » unknown values 
of €, so for these equations to be consistent their eliminant must 
be zero, or 


1 Xoo = X01 . Xocn—1y | 
A XX . Xn | 

0 = d2 X59 X41 . Xx(n—1) | (15) 
nN Xn0 Xn s Xnn—1) 


Returning now to eqn. (13) and substituting from eqn. (14), 
1 1 1 


(16) 


Nie ot tees 


Eqn. (16) is now an alternant determinant and is a polynomial 
of degree n in A. This determinant may be readily evaluated, 
but it is sufficient merely to determine its factors. It is obvious 
that (A — Z,) is a factor, because if A = Z, two columns are 
equal and the determinant is zero. (A — Z>), (A — Z;), etc., 
are also seen to be factors, and in a similar manner so also are 
(Z, — Zp), (Z1 — 23), (Z2 — 23), ete. 

Now under the assumption that none of the exponents p are 
equal, none of the Z are equal, and thus eqn. (16) is only zero 
when the factors (A — Z;), (A — Z)), etc., are zero; thus there 
are n possible values of A, 

(17) 


These values of A may now be inserted in eqn. (15), which 
then becomes a polynomial of degree n in Z: 


NZ 


1 Xo Xn » Xon—1) 
Z Xo Xu » Xiq—1) 

O27 420 Xo - Xxn-1)| + (18) 
Zz” Xno Xn . Xnn—1) 


This is then the characteristic equation quoted earlier which 
yields the required exponents, and hence the overall transfer 
function. 
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SUMMARY 

In this Part the oxide cathode operating at 1020°K is shown as a 
2-element device—a thin-film cathodic-core thermionic emitter covered 
by a relatively massive porous oxide matrix. The thin-film emitter 
determines the total emission available from the cathode, and the 
porous matrix settles the potential rise experienced by the current 
before it emerges into the outer vacuum. This potential rise is shown 
to depend only on the self-generated electron density within the matrix 
pores and to be largely independent of emission density from the core 
emitter. 


LIST OF PRINCIPAL SYMBOLS 


V4 = Potential between cores, volts. 
I, = Electron current through matrix, mA. 
Ry = Matrix resistance, ohms. 
Ry = Stable low level of matrix resistance—the 
‘characteristic resistance’, ohms. 
Rmax = Resistance of cold matrix, ohms. 
R’ = Resistance of unit thickness of matrix, ohms. 
AV4 = P.D. due to current through cold matrix, volts. 
Viz1, Vix = Respective heater voltages, volts. 
Py, Py2 = Respective heater powers, mW. 
T,, Ty = Respective cathode temperatures, deg K, 
Vr}, Vr2 = Respective thermo-electric correction e.m.f.’s. 
AVr — Vry > Vr. 


(1) INTRODUCTION 


Neglecting the small contribution made by solid semi-conduc- 
tion to overall conductivity at 1000°K, it seems well established 
that the whole cathode current is thermionically emitted at the 
cathodic core surface and travels vacuum-wise through the oxide 
matrix overlay. The core surface is thus the essential emitting 
mechanism which determines the total or temperature-limited 
current that can be drawn through the matrix. The oxide 
particles of the overlay itself are, of course, themselves powerful 
emitters, but they contribute nothing to the main cathode 
current. The matrix might, in fact, be regarded as a framework 
supporting a distributed space-charge whose function is to 
determine the magnitude of potential fall suffered by the electron 
stream. 

It is proposed in the present Part to examine the properties 
and functional interdependence of the thin-film core emitter and 
the porous matrix overlay. It will be shown that the two com- 
ponents can be physically separated and will work in isolation, 
but that their interaction is necessary for long-term cathode 
operation. Some attention will also be given to the function of 
the electron-exit boundary of the matrix, ie. the area of oxide 


This paper is a continuation of Monographs Nos. 221 R and 243 R, published in 
February and June, 1957 (see 104 C, pp. 221 and 243), and Nos. 268 R and 269 R 
(see pages 183 and 189). 

Correspondence on Monographs is invited for consideration with a view to 
publication. 

Dr. Metson is at the Post Office Research Station. 


that can be seen by the anode in a conventional diode. It will 
be shown that the thermionic state of this surface is immaterial 
to the conductance of the diode. 

The experimental valves used in the work were the standard 
S-type assembly and the 6D15 diode described in previous 
Parts.* To avoid the complications of gas action, all valves 
were fitted with active-nickel cores and were well aged before use. 


(2) THE THIN-FILM CORE EMITTER 
(2.1) Probable Nature of the Emitter 


The mechanical structure of the porous matrix at the cathodic 
core face can be opened to direct visual examination by using the 
standard (BaSr)CO, spraying technique on a glass cover slip; 
the boundary is then available for viewing through the glass. 
A detailed microscope study of the boundary under suitable 
arrangements of illumination shows that only a proportion of 
the glass surface is in physical contact with particles—probably 
some 50% or more is left bare and uncoated. The carbonate 
particles on the metal core surface at a typical cathode are pre- 
sumably arranged in similar fashion. Thermal conversion of car- 
bonate to oxide results in a small overall shrinkage of the matrix, 
but the shape and relative juxtaposition of individual particles 
remain unchanged. The core distribution of oxide particles in 
a fully processed valve is therefore likely to follow the same 
pattern as the carbonate distribution seen through the glass 
cover slip, i.e. only a modest proportion of the core surface will 
be in physical contact with the matrix. During thermal pro- 
cessing, however, the bare areas of cathode core will be subjected 
to the condensation of barium and strontium oxide vapours 
arising from nearby (BaSr)O particles. The bare areas therefore 
become coated with a thin film of mixed alkaline earth oxides 
and the probable thickness of the film will be a small fraction 
ofamicron. The totality of these thin-film cathode core patches 
is regarded in the paper as the basic thermionic emitter in the 
common oxide-cathode system, i.e. the thin-film area determines 
the total emission available from the cathode at any particular 
temperature, in contradistinction to the massive porous overlay, 
which merely determines the potential fall between the core and 
the electron-exit boundary of the matrix. 


(2.2) Emission from Thin Film of Alkaline-Earth Oxides 


The thermionic-emission properties of thin films of alkaline- 
earth oxides on nickel have been thoroughly examined recently 
by Woods and Wright,t who investigated both single and mixed 
oxides and compared the total emission measurements with 


* The dimensions of the S-assembly and the 6D15 type diode are given here for 
comparison. The 6D15 diode consists of a 2-watt cathode, with a total coated area 
of 0-45cm2, mounted inside a grid-type anode. This consists of about 73 turns of 
28 u. molybdenum wire at about 175 turns/in. The matrix thickness is approximately 
60 uw and the distance between the grid plane and the cathode surface is about 110 UL. 
The S-assembly consists of two of the same 2-watt cathodes clamped together by 
tungsten springs to form a sandwich of matrix. The sprayed thickness is increased so 
iat fhe ro matrix is about 150 thick. Full details, with processing data are set out 
in Part 1. 

t Woops, J., and Wricut, D. A.: ‘Thermionic Emission from Thin Films o Bari 
and Strontium Oxide’, British Journal of Applied Physics, 1954, 5, p. 74. ‘ec 
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Table 1 


EMISSION FROM VARIOUS CATHODES 


Cathode Deposition Thickness ha canes ee 
; microns amp/cm2 
Evaporation : 0: 
Evaporation 0-10 ile 
Evaporation 1:00 0: 
Spray 100-0 0- 
Evaporation 0-10 0- 
Spray 100-0 0: 
SrO on BaO Evaporation Both 0-10 3-0 
(BaSr)O Spray 100-0 35) 
BaO on SrO Evaporation Both 0:10 0-6 | 


those from the conventional form of sprayed cathode. Some 
typical results, extracted from the paper, are shown in Table 1. 

The results are illuminating, and show that a film of mixed 
oxides a fraction of a micron thick can give the same total emis- 
sion as the conventional form of oxide cathode with a sprayed 
matrix thickness of 1004. The higher emission from the film 
of SrO on BaO compared with that of BaO on SrO is interesting 
and in line with previous experience on sprayed cathodes. 

In comment on these results the author would observe that, 
when comparing the thin evaporated film with the common thick 
spray, Woods and Wright are, in effect, comparing the same basic 
emitter and therefore obtain comparable emission levels. 


(2.3) Separation of the Core Emitter 


Fig. 1(a) shows a schematic of the two core emitters and the 
' matrix of a processed S-type assembly. The thin-film emitters 


_i 
. 


Fig. 1.—Schematic of core emitter and matrix in S-type assembly. 
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are regarded as adhering strongly to their respective core faces 
and it can be imagined that the withdrawal of a core, as in 
Fig. 1(b), would take with it the associated thin film. Such an 
operation would permit comparison of the two conductances of the 
diode using either the thin film or the normal spray as emitter. 
An attempt has been made to realize the schematic of Fig. I in 
a practical way. An active-nickel core was sprayed with mixed 
carbonates in the usual manner and mounted about 80, from 
a similar core which was bare and free from spray. The system 
‘was then processed to the usual S-type schedule and sealed from 
the pump after firing the getter. After the valve had been 
activated to minimum impedance in both directions, the cores 
were set at 1020°K and the two-directional voltage/current 


Va» VOLTS 


Fig. 2.—Comparison"of diode conductance using thin-film and 
normal sprayed cathodes. 


x xX x Sprayed cathode. 
OOO Thin-film cathode. 


characteristics were measured. The results are set out in Fig. 2 
and show that the device is electrically symmetrical. The 
following conclusions are drawn: 


(a) During thermal treatment the bare core B becomes coated with 
a thin activated film of mixed barium and strontium oxides. The 
emission from this film at 1020°K is adequate to meet the space- 
charge requirement of the diode in the direction B-A. 

(b) The impedances in the A-B and B-A directions of electron 
flow are identical. 

(c) The activated block of matrix influences the potential fall 
through the device but serves no other purpose. 

(d) The position of the matrix block relative to the core faces has 
no influence on the impedance of the device, e.g. if the block is 
suspended midway between the two core faces, the impedance will 
remain unchanged. 

(e) The core emitter and the matrix block are separate entities and 
can function in isolation and independence of each other. 


The thickness of the film on core B in the above experiment 
was estimated from the known rates of evaporation to be 
about 0:1 p. 


(3) POTENTIAL FALL IN THE MATRIX 
(3.1) Limits of Fall 


Both conventional and S-type assemblies were used in the 
experiments, and it will be convenient to express the potential 
fall between cathode and anode in terms of ohmic resistance in 
both cases. It was shown in Part 1 that there is a lower limit to 
the resistance of a standard S-type assembly at 1020°K, and 
this was described as the characteristic resistance, Rp. In the 
standard S-type valve Rp is about 14 ohms at 1020°K, and it is 
imagined that this limit is set by a space-charge condition arising 
within the hollow pores of the matrix as a result of thermionic 
emission from the individual matrix particles. If the emission 
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from a group of particles is suppressed in some way, ©€.g. a 
localized gas action, the matrix will experience a localized 
increase of resistance. If the thin-film core emitter itself remains 
functionally unchanged, a gradual suppression of the emission 
from all the particles of the matrix will result in a steady increase 
in resistance from Ry to some upper limit R,nax- The present 
Section considers the magnitude of the resistance of such an 
emission-limited matrix, 


(3.2) Resistance of Zero-Emission Matrix 


Fig. 3 shows a schematic of the electrodes in an experimental 
valve. The core pieces A and B, of active nickel, were fitted 


Fig. 3.—Schematic of electrode arrangements in an experimental 
valve. 
Dimensions are in microns, 


with internal heaters and were sprayed to a thickness of 120 p, 
but were separated by a vacuum gap of 260. The heaters 
were brought out of the glass envelope separately, so that indi- 
vidual control of core temperature was possible and could be 
measured using the thermocouples T; and T,. The whole system 
was vacuum processed in the standard manner for an S-type 
assembly. 

The diode was examined in the following manner: both cores 
were set accurately at 1020°K and the voltage/current charac- 
teristic of the diode wave measured with core A as cathode and 
core B as anode. The heater voltage in B was then reduced to 
zero and that of A readjusted to maintain its temperature at 
1020°K. A second characteristic was measured, and results are 
set out in Fig. 4. The voltage across the diode was finally 
reversed and the total emission from the cold B core (540° K) 
was shown to be less than a microampere. The following con- 
clusions are drawn: 


(a) The electron stream, Z4, from A passes readily through the 
matrix. covering of B in both hot and cold conditions. 

(6) At constant J4 the diode voltage, V4, is increased by AV 4 
when the B matrix is cooled from 1020° to 540° K. 

(c) The total emission from the cold matrix is essentially zero and 
the electron density within its pores is likewise presumed to be zero. 


If the resistance of the cold matrix is R,.qx, then 
AV, = if Ae Ras 


and a scrutiny of Fig. 4 will show that R,,,, is approximately 
100 ohms and independent of current. 

These results can now be applied to a standard S-valve whose 
matrix has dimensions similar to those of the matrix on the B core 
shown in Fig. 3. Provided that the cathodic core emitter remains 
fully operative, the effect of reducing the matrix temperature from 
1020 to 540°K (or, indeed, to room temperature) is only to 
increase the resistance from Rp = 15 ohms to R,»qx = 100 ohms. 
The S-valve with a cold matrix but a normally functioning core 
emitter may, in fact, be likened in behaviour to a conventional 
diode with a vacuum gap equal to the matrix thickness. If the 
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Fig. 4.—Voltage/current characteristic with hot and cold conditions | 


of matrix. 


cathodic core emitter is now progressively cooled the diode 
resistance will change but little from R,,.,, = 100 ohms until the 


core emitter approaches temperature limitation of emission, — 
whereafter it will increase abruptly and drastically. This hypo- — 


thetical picture of S-valve action will be put to test in the next 
Section. 


(3.3) Independence of S-Valve Resistance and Core-Emission 
Density 


Fig. 5(a) shows schematically a standard S-assembly. The two 
cores are of active nickel, and it is assumed that the system has 
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Fig. 5.—Resistance gradients in an S-type assembly. 
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been vacuum processed to a gas-free condition, ie. Ry = 15 ohms 
at 1020°K. Neglecting the influence of a small testing current 
on the system, there will be no temperature gradient across the 
matrix when the core heater voltages are equal. Suppose now 
that the cathodic core A is maintained at 1000°K and core B is 
cooled, by heater-voltage adjustment, to 800°K. The electron 
density due to particle emission within the matrix pores will now 
show a gradient through the matrix, and this, in turn, will give 
rise to an inverse gradient of resistance. Fig. 5(b) shows such a 
gradient, where R’ represents the resistance of a small unit 
thickness of matrix at any point between cores faces; the exact 
manner in which R’ varies with position is immaterial to the 
argument. Next suppose that A is cooled from 1000 to 800°K 
and B is heated from 800 to i1000°K. This operation will 
result in gradients of electron density and resistance across the 
matrix, and R’ is as shown in Fig. 5(c). Owing to the essential 
symmetry of the S-type system and the symmetry of the tem- 
perature juxtapositions, it will be clear that the two resistance 
gradients in Figs. 5(b) and 5(c) are exact mirror images of each 
other. In short, the two temperature arrangements leave the 
overall matrix resistance unchanged, as in Fig. 5(c), but involve 
a 200° swing in the temperature of the cathodic core emitter. 
If, as suggested by Woods and Wright, the work function of the 
thin-film core emitter is about 1:2eV, a 200° shift in core-emitter 
temperature should involve a change in its total emission by a 
factor of 100. There arises therefore the proposition that the 
S-assembly resistance is independent of core emission density— 
providing, of course, that the total emission is greater than any 
current used to measure to the resistance. 

Practical realization of the above condition is simple if certain 
experimental precautions are taken. Fig. 6 shows a standard 


D.C. POTENTIOMETER 


Fig. 6.—Voltage measurements in presence of temperature gradient. 


active-nickel-cored S-valve operating in the gas-free condition 
and carrying a small electron stream, [4. If the core tempera- 
tures T,; and T> are equal, the resistance of the device will be 
accurately determined by the relationship Ry = V4/I,, where 
V, is the straightforward voltmeter reading across the diode 
terminals. There are, however, concealed in the relationship 
two thermo-e.m.f.’s Vr; and Vy, arising at the core/matrix 
boundaries, opposing each other in direction and dependent for 
magnitude on the core-face temperatures. The more correct 
relationship for resistance is therefore written 


VENT; 


Ra = rs 


where AVr = Vry = Vr2 and AVr = (iter T, = T>. ; 

Since the object of the experiment is to vary the relation of the 
-wo core temperatures to induce image-wise resistance gradients, 
‘tis clear that AV must be accurately determined by preliminary 
measurement. Fig. 7 shows the variation of AV with anodic- 


377 
-400 
200 
= 
€ 
= 
q 
+200 
+400 
750 850 950 losO 1150 
ANODIC CORE TEMPERATURE To, DEG K 
Fig. 7.—Variation of AV7/T2 at constant T; (980°K). 
core temperature at constant cathodic-core temperature. The 


method of measurement is by straightforward d.c. potentiometry 
—the voltmeter V , in Fig. 6 is discarded and the potentiometer is 
set in its place across terminals 1 and 2. The current J, is 
reduced to zero by disconnection and AV; is measured over a 
range of T, from 800 to 1200°K with 7, constant at 980°K. 
The identical mirror-image characteristic is obtained by trans- 
posing the temperature conditions. 

The setting-up of the image-wise resistance gradients can be 
achieved in three slightly differing ways. Thus, if the core- 
heater powers are represented by P,,; and Py, core-heater 
voltages by Vzz; and Vz. and core temperatures by T, and 75, 
the following arrangements are possible: 


(a) With T; constant, measure Ry/T2; repeat for image with core 
temperature transposed. 

(6) With Vz; constant, measure Rg/Vz2; repeat for image with 
core-heater voltages transposed. 


The three techniques lead of course to very similar results, and 
method (a) will be used as an example. Four separate charac- 
teristic measurements are required, and these are scheduled as 
follows: 


(c) Set 14 =0. Measure AV7/T;, over range 1 150-950°K with 
T>2 = 1150°K constant. 

(d) Repeat for image with transposed core temperatures. 

(e) Set 14 = 20mA constant. Measure V4/T, over range 1 150- 
950° K with T> = 1150°K constant. 

(f) Repeat for image with transposed core temperatures. 


From these four results are derived the two required variations 
of R,/T, at constant T,, and R,/T> at constant T,, and these 
are set out in Fig. 8—a single curve being used to cover both 
characteristics. At every point on the curve the matrix has 
therefore two resistances of equal magnitude resulting from 
image temperature gradients but two different cathodic-core 
emitter temperatures. The resistance of the valve is therefore 
independent of the core-emitter temperature over a range of 
some 200°—corresponding to a hundredfold change in core 
emission. 

A second example using method (6) is shown in Fig. 9. The 
technique is exactly the same as for (a), except that Vy, and V;z> 
are transposed instead of T, and T>. The range of the measure- 
ment has been extended, and it will be apparent from the charac- 
teristic of variable cathodic-core heater voltage that the two curves 
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Fig. 8.—Image resistance under image temperature gradients: case A. 


x & & T; constant; T> variable. 
OOO Ty, variable; T2 constant. 
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Fig. 9.—Image resistance under image temperature gradients: case B. 


xX X X Vay constant; Vq2 variable. 
OOO Vai variable; Vz2 constant. 


diverge when Vz; falls to 3-5 volts. This effect is always 
observed and is probably due to reduction in the space-charge 
smoothing factor of the cathodic-core emitter. 


(3.4) Approach to Maximum-Resistance State 


It was concluded in Section 3.2 that the standard S-valve with 
an effective core emitter but with a zero-emission matrix should 
have a maximum resistance of about 100 ohms. The approach 
to this condition can be examined by applying a temperature 
gradient in such a way that the emission in the matrix adjacent 
to the anodic core approaches zero. A standard S-assembly is 
arranged with its cathodic-core emitter temperature, T,, constant 
at 980° K while the anodic-core temperature, T>, is varied between 
1150 and 650°K. The characteristic of R,/T, at constant Ty, 
derived in the usual way, is set out in Fig. 10. As 7> falls from 
1150°K, R, rises at increasing rate up to a maximum rate at 
890°K, where it remains constant down to 760°K. Further 
decrease of T, leads to a steady decrease in the rate of rise, and 
the general appearance of the curve suggests an asymptotic 
approach to a limiting resistance of 60 ohms. The bulk of 
this resistance must be in the cold half of the matrix, and it 
seems reasonable to suggest that the resistance of the whole 
matrix in a cold state is likely to be about 120 ohms. 


METSON: THE CONDUCTIVITY OF OXIDE CATHODES: 


Rg, OHMS 


650 7s0 850 950 
ANODIC CORE TEMPERATURE, DEG. K 


Fig. 10.—Showing approach to constant resistance with decrease of 
matrix temperature. 


T, = 980° K constant. 


(3.5) Positive Functions of the Matrix 


Experiment has shown that the oxide cathode is a combination 
of two distinct entities—the thin-film core emitter and the bulky 
porous overlay—each capable of acting in isolation from the 
other. The core emitter is clearly essential to valve action, but 
the overlay so far appears to do little but act as the equivalent of 
a power-absorbing series resistance. 
two very necessary purposes, namely maintenance and protection 
of the core emitter. 

The thin-film emitter operating at 1020°K is vulnerable to 
wastage by evaporation, and it is perhaps the principal function 
of the oxide overlay to replace this loss as fast as it occurs by 
counter-evaporation. The common oxide cathode is thus a 
dispenser system with its essential thin-film emitter constantly 
renewed from the massive reserves of the overlay. 

The second purpose of the overlay is to protect the film emitter 
from gas entering the cathode from the surrounding vacuum. 
To reach the core emitter the gas must work its way through the 
pore system, and it is reasonable to suppose that a high-density 
matrix will be more difficult to penetrate than a low-density one. 
Work by R. W. Lawson at Dollis Hill supports this view, and 
Fig. 11 shows the response to a common oxygen attack by two 
similar diodes occupying the same glass envelope—that with a 
matrix density of 0-61 showing much greater ability to withstand 
the attack than that with a density of 0-43. A further protective 
influence arises from the ability of the matrix to convert all gas 
within its pores to the negative ion state. The passage of a 
current through the matrix therefore tends to move gas away 
from the core emitter. 


(3.6) Role of the Electron-Exit Boundary of the Matrix 


The electron-exit boundary of the matrix appears to play no 
part in the operation of a conventional diode, apart from the 


In fact, the matrix serves — 
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Fig. 11.—Influence of matrix density on resistance of cathode to 
oxygen attack. 


Commion pressure = 2 x 10-SmmHg. 
Common cathode temperature = 1 150°K. 
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Fig. 12.—Schematic of experimental assembly for surface deposition 
of platinum. 


passive one of providing holes for the escape of electrons from 
the interior. Complete suppression of electron emission from 
all visible parts of an oxide cathode should therefore have no 
influence on the conductance of a conventional valve, and Fig. 12 
shows an experimental valve designed to test this assertion. An 
active-nickel-core cathode with the normal double-oxide matrix 
was set up as a diode with an open grid-type electron collector. 
After the usual vacuum processing, a voltage/current charac- 
teristic was measured with the cathode core set to 985° K with 
the aid of the thermocouple T. Two fine platinum wires A and 
B were then heated electrically to such a temperature and for 
such a time as to evaporate a grey film of platinum metal on all 
visible parts of the oxide cathode surface. The platinum wires 
were then cooled and the oxide cathode was given a brief 
reactivation treatment to recover it from the gas attack inseparable 
from the act of bringing the platinum wires close to their melting 
point. The cathode core was finally set again at 985° K with the 
aid of the thermocouple, and the voltage/current characteristic 
was remeasured. The diode conductances before and after 
platinum deposition are set out in Fig. 13 and seem to be identical. 
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Fig. 13.—Voltage/current characteristic before and after surface 
deposition of platinum. 


Common core temperature = 985°K. 
xX X X Before. 
OOO After. 


It is assumed that thermionic emission from the surface deposit 
of platinum at 985°K is negligible. The appearance of the 
cathode surface before and after deposition is shown in Fig. 14. 


Fig. 14.—Appearance of cathode surface after deposition of 
platinum. 


The outer cathodes are included for comparison, and show cathodes processed 
normally. 


(4) CONCLUSIONS 


The present Part may now be summarized in the form of the 
following conclusions, which are drawn up on the assumption 
that the whole of the diode current passes vacuum-wise through 
the hollow pores of the matrix in the temperature range 
900-1 100° K. 


(a) The common oxide cathode has two distinct functional com- 
ponents—the thin-film core emitter and the relatively massive porous 
matrix overlay. The two components can be physically separated 
and shown to act independently and in isolation. 
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(b) The core emitter determines the total or temperature-limited 
emission from the cathode. 

(c) The matrix overlay determines the voltage rise between core 
surface and electron-exit-boundary. 

(ad) The voltage gradient through the matrix is determined only 
by the electron density arising within the matrix as a result of pore- 
wall thermionic emission. The gradient is largely independent of the 
level of total emission from the core emitter, provided that this is 
sensibly larger than the current traversing the matrix. 

(e) The electron-exit boundary of the matrix plays no active part 
in the phenomenon of valve conductance, apart from keeping 
thermal emissivity as low as convenient. 
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The 5% element of the current which is known to travel the 
matrix by way of a solid semi-conduction path will, of course, 
modify these conclusions to some small extent. 
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SUMMARY 


The paper is concerned with the development of a modulator 
capable of giving much lower stable levels of carrier leak than previously 
obtained. After consideration of various types of modulator, the 
conclusion is reached that the constant-current modulator, originally 
proposed by Cooper, is probably capable of giving the highest stable 
level of carrier rejection. This modulator consists, essentially, of a 
ring modulator whose input transformer is, ideally, replaced by 
infinite-impedance valve generators feeding each rectifier input 
terminal. 

The action of the constant-current modulator is analysed in detail 
on the basis of small differences in the characteristics of the rectifying 
elements, and associated topics are discussed. Experimental evidence, 
obtained using a carrier frequency of 3kc/s, is given which, to a con- 
siderable degree, supports the theoretical analysis. 

The modulator, thus developed, is capable of giving carrier leak 
levels more than 90dB below the carrier current from each source 
which are stable with temperature and time. Also, the action of the 
modulator is such that the fundamental component of the carrier 
leak is not affected by relatively large unbalances in the carrier currents 
from each source. 


(1) INTRODUCTION 


The action of any amplitude-modulating system is essentially 
to produce outputs of additive combinations of carrier and signal 
frequencies and their harmonics. Thus, if the carrier and signal 
frequencies are denoted by f, and f, respectively, the output is 
of the form X(nf, -+ mf,), where n and m are zero or integers. 
In many cases, however, only the sidebands, f, + f; are required, 
so that all other output components must be removed. The 
carrier-frequency harmonics and their associated sidebands are 
easily removed by simple filters, leaving only the output of 
carrier frequency to be removed. The complexity of filters for 
this purpose rapidly increases when the signal extends to low 
frequencies, owing to the difficulty of separating f, and f, + of. 
Thus a modulator circuit which inherently rejects the output of 
carrier frequency is desirable. Such a device is called a balanced 
modulator. 

All balanced-modulator circuits depend for their action on 
the exact similarity of at least two modulating elements, and 
since these must be non-linear, such an exact similarity is 
impossible to achieve in practice so that there is some residual 
output of carrier frequency. This is termed carrier leak. 

Many different rectifier-modulator circuits have been proposed 
and used for obtaining low levels of carrier leak. Most of these 
are elaborations of the basic series, shunt and ring modulators, 
mainly the latter. Each of these basic modulators produces 
approximately the same carrier-leak voltage under given con- 
ditions of carrier voltage, rectifier characteristics, etc.,! but the 
ring modulator, having the highest efficiency, is capable of giving 
a higher ratio of wanted sideband output to carrier leak. 

The basic circuit of the ring modulator is shown in Fig. 1. 
A simple description of the action of the circuit is obtained by 
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Fig. 1.—Basic ring-modulator circuit. 


assuming that the rectifiers have a finite, constant forward resis- 
tance and an infinite reverse resistance, and that the carrier vol- 
tage is large compared with that of the signal so that the condition 
of the rectifying elements is determined solely by the polarity of 
the carrier. Thus on alternate carrier half-cycles either rectifiers 
1 and 3 or 2 and 4 are conducting and the signal in the output 
transformer alternates in polarity at the carrier frequency. The 
signal is thus, ideally, multiplied by a perfect square wave, or, 
more precisely, the modulating function is a square wave. If the 
circuit resistances in the two possible carrier paths (there will 
always be two rectifiers with infinite resistance) are equal, the 
carrier currents in each half of the output transformer will be 
equal and in antiphase. Thus there will be no output of carrier 
frequency, and the modulator is therefore balanced. 

Owing to the non-ideal characteristics of practical modulating 
elements, the shape of the modulating function in an actual 
modulator lies between a square wave and a sine wave (or the 
waveform of the carrier, if the latter is non-sinusoidal), and, as 
the characteristics of rectifiers vary from sample to sample, the 
basic circuit is incapable of giving complete balance to the 
carrier. 

The carrier leak is mainly dependent on the difference in 
forward resistance of the rectifiers; the only requirement for the 
reverse resistance is that it should be sufficiently high for its 
effect to be neglected. If rectifiers are used which have a ratio x 
between maximum and minimum forward resistance, and if the 
carrier source resistance is large compared with the rectifier 
forward resistance, it can be shown that the maximum funda- 
mental component of carrier leak to be expected is given by 


Va % =Vfx — 1) ee ta ee aD 


where V, is the carrier voltage appearing across the rectifiers.! 
For balanced-modulator applications, rectifiers may be obtained 
which are selected so that their forward resistances lie within a 
range of +5%, making x equal to 1-1. Using such rectifiers, 
therefore, it is possible to obtain a fundamental component of 
leak equal to 0-064 V,; ic. the fundamental leak is approxi- 
mately 24dB below V,. 

In general, the output which is of interest is that of one first- 
order sideband, and allowing for a normal overall conversion 
loss, this output would be approximately 6dB below the input 
signal level, which must itself be considerably below the level of 
V.. owing to the requirements of linearity. Thus, in such a case, 
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the carrier leak would be less than 18 dB below the level of the 
sideband output. : 

There are, however, ways of improving this balance; for 
example, it is common practice, having sorted rectifiers into 
groups with +5% forward resistance, to select further from 
these groups, by means of trial-and-error substitution, sets of 
four rectifiers giving low carrier leak. The leak may be further 
reduced by the interposition of variable voltage-dividers in the 
ring.2 These voltage dividers can be adjusted to balance the 
rectifier forward resistances, thus effectively making x closer to 
unity and hence reducing the carrier leak. The carrier leak 
cannot be completely eliminated as the forward resistances can 
only be balanced at one point in the cycle. 

With such processes, it is possible to obtain a fundamental 
component of carrier leak which is of the order of 60dB below 
the level of V... However, the dependence of the leak on varia- 
tions in the rectifier characteristics remains unchanged; ie. if 
these variations are such as to cause the rectifiers to have a ratio 
of maximum to minimum forward resistance of x, eqn. (1) will 
apply. Considering only temperature effects, it is quite possible 
for the difference in the temperature coefficients of the four 
rectifiers to be 1% per deg C, so that a change in temperature of 
1° C would suffice to worsen the leak from 60dB to 45 dB below 
the level of V,. It is clear from this example that the stability of 
highly balanced ring modulators is indeed doubtful. The many 
variations of the ring modulator which have been developed to 
reduce the carrier leak all suffer in a similar manner from the 
effects of variation of forward resistance, and although higher 
degrees of balance have been obtained, they are only suitable 
for applications requiring short-term stability. 

To obtain a modulator which is capable of a very high degree 
of carrier balance, with good long-term stability, a method must 
be used which eliminates, as far as possible, the first-order 
dependence of the leak upon these variations in forward resistance. 
A method proposed by Cooper? of overcoming this dependence 
is capable of achieving a large measure of success, and the 
present paper is a report of a detailed study of it. The circuit 
is reproduced here for convenience and is shown in Fig. 2. It 
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Fig. 2.—Cooper’s constant-current modulator circuit. 


consists, essentially, of a ring modulator whose input transformer 
is replaced, ideally, by infinite-impedance valve circuits feeding 
each rectifier input terminal. Thus, throughout the major part 
of the cycle, the carrier currents in each half of the primary 
winding of the output transformer are independently controlled 
and should therefore be capable of balance by adjustment of the 
input currents. In the region of the carrier zeros, all four 
rectifiers become of comparable resistance, and accordingly the 
constant-current feature is lost for a small fraction of the cycle. 
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A theoretical analysis of the constant-current modulator is 
given in the following Section, and experimental results are dis- 
cussed in Section 3. The latter were obtained at a frequency of 
3kc/s, so that the effects of parasitic capacitances (which are not 
discussed in the theory) were very small. The capacitances 
between the output terminals of the drives and earth, owing to 
their effect on the output impedance of the drives, are more 
important than the rectifier internal capacitances. It appears 
that the operation of the circuit would not be greatly affected 
below frequencies of the order of hundreds of kilocycles per 
second, 


(2) THEORETICAL ANALYSIS 


(2.1) Leak at Carrier Peaks 


This theoretical analysis is obtained on a basis of small 
differences in the characteristics of the four rectifiers. 

An expression for the instantaneous carrier leak of a constant- 
current modulator can be obtained by assuming a completely 
different characteristic for each of the modulating elements. 
This expression is, however, too complex to have any real value 
and the case will therefore be considered where three of the 
rectifiers have exactly similar characteristics and the fourth has 
a slightly different one. By using the results obtained, the leak 
waveform in the worst possible case for rectifiers whose charac- 
teristics lie within given limits may be evaluated, as will be 
shown later. 

Three main assumptions are made, namely: 

(a) The impedance of the carrier generators is infinite. 


(6) The currents from each generator are exactly equal in mag- 
nitude and phase. 

(c) The primary winding of the transformer may be regarded as 
a short-circuit to the carrier currents. * 


If rectifier 1 of Fig. 3 is the one with the odd characteristic, 
the circuit at any instant can be represented as in Fig. 4. The 
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Fig. 3.—Equivalent circuit of constant-current modulator. 
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Fig. 4.—Equivalent circuit of constant-current modulator using one 
rectifier with characteristics different from those of the others. 


instantaneous unbalance current in the output transformer is 
given by 


= @ —i) +G — iy 
and subsequently, defining the unbalance current ratio, u, as 


* The actual impedance seen is dependent upon the ratio of the carrier curr i 
each half of the primary winding, and if this ratio is not greatly different four ney 
the assumption of a short-circuit is justifiable. The ratio will be furthest from unity 
at the carrier zeros, but here the rectifier resistances are large so that any departure 
from a short-circuit would have negligible effect. A more detailed treatment of the 
Output transformer is given in Appendix 2 of Reference 6. ; 
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the ratio i,/i, where i is the instantaneous carrier current, it can 
be shown that 

2Ri(R, — Ra) 
(Ra + Ry(Rz + Ry) 


The unbalance current ratios at the peak values of the carrier 


current can be expressed in terms of the rectifier forward and 
reverse resistances, ry and ry. 


(2) 


Carrier peak when rectifier 1 is in its forward condition. 
For this, vy is substituted for R,, rf, for Ry, and r, for R,. 
2rp(rp — re) 

(eer rr ip) 


As r, is normally very much greater than We, 


Hence Up = 


(3) 


2 = 
up 7 — Thi) 
lp ; 
Or, if there is a small difference Are between re and rpy, the 
expression becomes 


r 
ug ~ 2Ay (4) 


Carrier peak when rectifier 1 is in its reverse condition. 


Substituting r, for R,, rz; for Rj, and ry for Ry, and using 
App as the difference between ry, and rp, 
Up ™ 2A,£ (5) 
lp 
The results for up and uz illustrate a very important principle, 
namely that the matching of the reverse resistances of the rectifiers 
is just as important as that of the forward resistances. In practice, 
A, is likely to be much greater than A, and will usually be the 
more important cause of carrier leak. This result, being opposite 
to that for the ring modulator, is one of the effects of constant- 
current operation. Furthermore, the results contain a factor 
r¢[r, which is absent in the ring-modulator results. As this ratio 
is normally of the order of 4 x 10~4, the advantages of the 
constant-current modulator are already apparent. It should be 
noted that A; is synonymous with the factor (x — 1) appearing 
in eqn. (1). 
(2.2) Leak near Carrier Zeros 


In an experimental constant-current modulator described by 
Tucker,! the output waveform contained narrow peaks of 
relatively large amplitude in the region of the carrier zeros. 
These peaks were attributed to the loss of constant-current 
operation in these regions due to all four rectifiers assuming the 
same order of resistance, as explained by Cooper.* The peaks 
were also thought to contain the major portion of the leak output 
energy. This explanation is not entirely satisfactory, as the 
carrier current in the region of the peaks is very small, so that 
to produce such peaks the unbalance current ratio would have 
to be very large. 

An approximate idea of the behaviour near the carrier zeros 
can be obtained by assuming the rectifiers to obey an exponential 
law of resistance versus voltage. It has been suggested that, at 
and near cross-over, the rectifier resistance may be very closely 


represented by i 


r=rTYr 0€ | 
This law is discussed fully in Reference 4. One of the useful 
features of the law is that, for most rectifiers of the same type, 
the index b tends to be constant and differences in rectifier 


characteristics are caused by variations in ro. 
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In order to obtain a theoretical result for the leak, it is assumed 
that the voltages across all four rectifiers are equal. This is not 
entirely compatible with the idea of constant-current operation, 
but if the assumption is not made, V becomes a function of 
rectifier resistance and current and the expressions become 
unmanageable. It will be appreciated that the assumption 
becomes more accurate towards the carrier peak when the odd 
rectifier is in its reverse condition, and least accurate towards 
the other carrier peak. 

Substituting roe—°” for R,, rge—” for RZ, and roe” for Rp 
in eqn. (2), and defining Aprp as ro — rg, then 


= Ao 
4 cosh b V(2 cosh bV — Age") 


The unbalance current is given by the product of u and i, the 
instantaneous carrier current. The circuit in each constant- 
current output consists of two rectifiers back-to-back, whose 
parallel resistance is given by rp/(2 cosh bV), neglecting the 
effect of Ap in this case. Hence the current from each source is 
approximately given by 


(6) 


; — 2V cosh bV (7) 
ro 
The instantaneous unbalance current is therefore 
2AoV 
2 (8) 


4 ro(2 cosh bV — Age”) 


Assuming that eqn. (8) cannot be solved for V analytically, 
the method of obtaining the variation of i, with i consists in 
calculating the variation of i, with V from eqn. (8) and, using 
eqn. (7), evaluating i for each value of V. Fig. 5 shows the 
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Fig. 5.—Leak waveform near the carrier zero. 


Calculated from the theoretical law r = roe—6V, for the case of one unbalanced 
rectifier and sinusoidal carrier of 5mA peak. 
b = 16; ro = 10 kilohms. 


variation of i, with degrees of cycle for a peak carrier current 
of 5mA and with typical values of 5 and rp as 16 and 10 kilohms, 
respectively. It is apparent from these curves that there is a 
maximum value of unbalance current which occurs at about 
0-3° of the carrier cycle. As Ao is increased further, i.e. above 
0-5, there is a tendency for the maximum to occur later in the 
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cycle, but over the range considered the position of the maximum 
is practically independent of Ap. Also, the term Aye~*” in 
eqn. (8) is only important at angles below 0-1° for the positive 
half-cycle, so long as Ag is less than 0:5; hence, for positive 
carrier, 

rg cosh bV © 


(9) 


i, ~ 


On the negative carrier half-cycle, the term Aye~°” becomes 
Ave”, and as cosh (—bV) = cosh bY, from eqn. (8), 
2No 


[en (10) 
ue ro(2 cosh bV — Age’”) 


Over the range considered, Aye~°” is small, so that Ape?” is 
effectively equal to 2A, cosh bV, and hence, from eqn. (10), for 
negative carrier, 

Pre AoV 

1u =~ rod — Ap) cosh bV 


(11) 


By differentiating eqns. (9) and (11), the maximum values of i, 
and their positions in the cycle may be obtained. For positive 
carrier, the peak value of unbalance current is 0-663A9/bro amp 
and this occurs at a carrier current of 4-35/brg amp; for negative 
carrier, the peak value of unbalance current is 0: 663A9/bro(1 —Ao) 
and the position is as for positive carrier. 

The magnitudes of these peaks of unbalance current and the 
carrier current at which they occur are seen to be independent of 
the peak value of carrier current. This is to be expected, since 
the effect of increasing carrier current should be to contract, in 
time, those parts of the leak waveform near the carrier zero and 
to insert new parts in the region of the carrier peaks. 

The results obtained from this exponential analysis can be 
compared with those obtained for the unbalance currents at the 
carrier peaks by considering typical values for the various 
parameters. Typical parameters for germanium rectifiers are: 
rg = 100 ohms, ro = 10 kilohms, r, = 300 kilohms, and b = 16. 
Assuming that Ay = A, = Ay = A, and that the carrier current 
is 5mA peak (a normal value), the following results are obtained: 
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Unbalance current at carrier peaks. 
Peak unbalance current near carrier zeros 


As a check on these results, and to investigate the intermediate 
region between carrier zero and carrier peak, the leak current 
has been calculated using eqn. (3) and taking the values of gs 
and r, from actual rectifier characteristics. The characteristics 
used are (a) the geometric mean between the 67°%% sample 
limits of a batch of germanium rectifiers, and (b) the median 
of the distribution curves of a batch of 50. The resis- 
tances at +1 volt are respectively (a) 100 ohms, 100 kilohms, 
and (5) 180 ohms, 220 kilohms. The unbalance has been 
achieved, in each case, by using three rectifiers with these charac- 
teristics, and subtracting 10% of the resistance at each voltage 
to obtain the characteristic of the fourth rectifier, thus obtaining 
a constant value of A of 0-1. Compared with the exponential 
law, this is equivalent to subtracting 10% from the value of ro. 
The calculated variations of unbalance current with carrier 
current are shown in Fig. 6, which also includes, for comparison, 
the relevant portion of the variation near carrier zero, calculated 
using the values of b obtained from these characteristics, in the 
exponential law. 

The absence of any definite peaks near the carrier zeros in the 
waveforms obtained from actual rectifier characteristics is 
probably due to the variation in the index b. This diminishes 
rapidly in the reverse direction and has a tendency to rise, 
initially, in the forward direction. Fig. 7, which shows the 
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Fig. 6.—Variation of instantaneous unbalance current for the case of 
three similar rectifiers and one differing by a constant fraction. 
The leak near cross-over, calculated using r = roe—V is partially shown. (a) and 
(b) refer to characteristics given in Section 2,2. 
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Fig. 7.—Variation of unbalance current over the carrier cycle for a 
sinusoidal carrier current of 5mA (r.m.s.). 


leak waveform for a carrier current of 5mA r.m.s., calculated — 


from Fig. 6, indicates that the carrier leak is of fairly constant 
magnitude throughout the half-cycle. 

This analysis of the leak in the region of the carrier zeros 
indicates that, although peaks in the unbalance current in this 
region may exist, their magnitude is insufficient to explain the 
type of waveform obtained in practice. 


(2.3) Effect of Unbalance in Carrier Currents 


Cooper? proposed that it should be possible to balance the 
modulator by adjusting the magnitude of the current from one 
driving circuit. Experimental evidence has shown, however, 
that, although the even harmonics of the carrier leak are altered 


by such an adjustment, the magnitudes of the fundamental and © 


odd harmonics remain substantially unchanged even with 
relatively large unbalances—of the order of 10°%—of the carrier 
currents. An approximate idea of this action may be obtained 
as follows. 

Consider, for example, that the resistance of the dissimilar 
rectifier is low; then, if the carrier currents are equal, too much 
current will flow through this rectifier and too little through the 
normal rectifier which is effectively in parallel with it and biased 
in the opposite direction. If, now, the carrier current from one 
source is reduced to counteract the excess of current in the 
dissimilar rectifier, the current flowing through the parallel 
rectifier will become still less than normal. Thus, there would 
be an alteration in leak waveform such that the instantaneous 
leak current would be reduced at one carrier peak and increased 
at the opposite one. 

A more precise analysis is obtained by reconsidering the 
circuit of Fig. 4 with different carrier currents, i, and i, such 


HANDS: A RECTIFIER MODULATOR 


that i, — i, = di,. By similar methods of analysis, an equation 

for the unbalance current ratio is obtained, namely 

_ RaRy — RR) — 8) + (RZ — R,RS 
(Ra + Ry)(Ra + Rp) 

Considering the carrier peak when rectifier 1 is in its forward 


condition, using the same notation as before and assuming that 
rf < lb, 


u 


(12) 


ig 
ux Apr —§) +6. (13) 
Considering the opposite carrier peak, the expression for wu is 
found to be 


(14) 


Thus, if 5 is small so that (2 — 8) is approximately 2, altering § 
to reduce wu in eqn. (13) will increase uw by an equal amount in 
eqn. (14), and vice versa. Therefore, whilst it is possible to 
reduce the magnitude of the leak at one carrier peak, its mag- 
nitude at the other peak is increased by a similar amount, and 
the amplitude between these points on the leak waveform 
remains unchanged. This statement will be true to an extent 
which depends on the validity of the assumption that ip <rp 
at parts of the cycle away from the carrier peaks, so that a wave- 
form is obtained which is quite closely specified by the 
relationship 

i{0) — i8 + 7) = C6) C15) 


where C(6) is a function of 6, but is constant at a given value of 
0, regardless of the value of carrier current. By the use of a 
Fourier technique it is possible to show (a) that the magnitude 


of the even harmonics does not affect the condition of eqn. (15), 


and (6) that, if the magnitudes of the odd harmonics remain 
unchanged, the condition of eqn. (15) is satisfied. However, to 
prove the converse of these results, a further condition specifying 
the waveform seems to be required. In any event, it is clear 
from these results that the odd harmonics, if altering at all, will 
alter much less than the even harmonics. 


(2.4) Maximum Leak to be expected in General Case 


This Section deals with the case where the characteristics of 
the four rectifiers in the constant-current modulator lie within 
a given range. The maximum leak will be obtained with the 
appropriate combination of rectifiers whose characteristics lie 
on one or other of the extremes of this range. One such com- 
bination is found to be when rectifiers 1 and 2 (Fig. 3) lie on the 
high extreme, and rectifiers 3 and 4 lie on the low extreme, in 
both forward and reverse conditions. 

For most of the rectifier data studied, it is apparent that the 
fractional difference between the upper and lower limits of the 
rectifier characteristics assumes two reasonably constant values, 
A; and A,. The use of this assumption enables a simple expres- 
sion to be obtained for the maximum expected leak. 

Considering the carrier half-cycle when rectifiers 1 and 3 are 
in their forward condition, those rectifiers which lie on the 
lower limit, namely 3 and 4, are fed from different sources but 
drive the same end of the output transformer. . Thus, if rectifiers 
i and 2, whose characteristics lie on the upper limit, are con- 
sidered to be normal, each low-resistance rectifier, ie. 3 and 4, 
ean be considered to be a case of one unbalanced rectifier, and 
che resulting leak will be the sum of two such cases, given by 


a= 2L(Ay + A,) 
a) 


On the opposite half-cycle, the leak will be given by the same 
expression but of opposite sign. 
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Neglecting, temporarily, any effects near the carrier zeros, it 
is clear that the leak waveform will lie within a square wave of 
peak-to-peak amplitude given by 


+ f (Ay a6 Ane, 
lp 


where i,,., is the peak value of carrier current. Thus the maximum 
t.m.s. value of the fundamental component of carrier leak using 
such rectifiers would be given by 


8 rf 
7 Vp 


(Ay + A,)I (16) 


where J is the r.m.s. carrier current from each source. 


(2.5) Evaluation of Ay and A, from Rectifier Characteristics 


If the regions near the carrier zeros are neglected, the rectifier 
configuration in the constant-current modulator can, to a first 
approximation, be divided, as far as currents and voltages internal 
to the configuration are concerned, into two pairs of parallel 
back-to-back rectifiers. The voltage across such a pair of 
rectifiers is determined by their parallel resistance, which will, 
over most of the carrier cycle, be quite closely equal to the 
resistance of the rectifier which is in its forward condition. 
Thus, if the resistance of the latter rectifier is known for any 
current, the voltage across the rectifier which is in its reverse 
condition will be known for that current, so that the charac- 
teristics required are forward-resistance/current and reverse- 
resistance/voltage. The value of Ay can then be taken as the 
fractional difference of the forward-resistance/current charac- 
teristic. The value of A,, however, is not so easy to determine, 
as neither the voltages nor the currents are the same for the 
two rectifiers which are in the reverse condition. It is clear 
that the voltages across them are more nearly equal than the 
currents through them, and as the resistance/voltage charac- 
teristic of a rectifier usually tends to flatten off in the reverse 
direction, it should be sufficiently accurate to take A, as the 
fractional difference of this characteristic. 

Rectifier forward characteristics are rarely given in the form 
of resistance/current, but are often given as resistance/voltage, 
so that it is of interest to know the relationship between the 
fractional tolerances of these characteristics. The evaluation of 
this relationship is facilitated by using the law R = AI~7/3, 
where R is the rectifier resistance, A is a constant for a given 
rectifier, and J is the current through the rectifier. This law is 
found to hold quite accurately for forward currents greater than 
104A, but it cannot be modified or extended to cover the reverse 
range. Differences in rectifier characteristics are usually due to 
differences in the constant A, rather than any change in the index. 
It follows from this law that the forward-resistance/voltage 
relationship is R = A3V~2, 

Consider two rectifiers obeying these laws, one a normal 
rectifier and the other low in resistance owing to a lower value 
of the constant, and assume this new constant is (1 — 6)A. It 
is evident that, if 5 is small so that (1 — 5)? is approximately 
(1 — 36), the difference in the resistance of the two rectifiers, 
measured at the same voltage, will be approximately three times 
that measured at the same current. Thus A, will be of the 
order of one-third of the fractional difference taken from the 
forward-resistance/voltage characteristic. This is a separate ~ 
advantage of using constant-current operation; not only does 
the constant-current modulator give a lower level of carrier leak 
than a ring modulator using the same rectifiers, but also the 
forward fractional difference of the rectifiers is effectively 
improved. 

The resistance/voltage characteristics of rectifiers usually show 
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a much larger fractional difference for the reverse condition than 
for the forward condition. This effect will be enhanced by the 
use of resistance/current characteristics, so that, for the constant- 
current modulator, it should normally be possible to neglect Ay 
in comparison with A, when computing the leak. 

A further result arising from the use of this power law is that 
the voltage across the rectifiers, over the major part of the cycle, 
will be given by 

V = Ai'3 


where i is the instantaneous carrier current. Thus this voltage 
is considerably more square in waveform than that of the carrier 
current. This result, coupled with the flattening of the resis- 
tance/voltage characteristic, increases the justification of obtaining 
A, from this characteristic. 


(2.6) Driving-Circuit Impedance Requirements 


In order to assess the impedance requirements of the driving 
circuits, it is necessary to consider how the balance is affected 
by the values of these impedances. The circuit when the 
impedance of the driving circuits is not infinite is shown in 
Fig. 8. As before, the case is considered where rectifier number | 


LOAD 


Fig. 8.—Modulator with separate finite-impedance sources. 


has a dissimilar characteristic. The voltages applied to each 
driving circuit are assumed to be equal, and the transformer 
is considered to act as a short-circuit. The unbalance current is 
found to be 

; R,(Rq — Ra)(Rp + 2R) V 

* [RRs + RRa + Rp) RoR, + RRA R;)] 


If the unbalance is small and if the driving-circuit impedance 
is large compared with the resistance of two back-to-back 
rectifiers, the currents from each source are approximately given 
by the relationship 


(17) 


Using this expression for V, an expression for u can be obtained: 


thr RyRy nd Ry) (Ry =e 2R) 
i [RGRp + RR, + Rp) [Ra + Ro] 


If the driving-circuit impedance is very large so that R > R,, 
eqn. (18) reduces to eqn. (2), where the driving-circuit impedances 
were considered infinite. Clearly, the requirement R > R, will 
be most difficult to meet when R, represents a reverse resistance. 
Taking this case and assuming that r, > rp and R> rr, 1S 
given by 


u= 


(18) 


tp Th + 2R 


ws Ar 5 (19) 


The minimum value of uw is clearly obtained when R is infinite. 
A reasonable compromise might be to have R = r,, when the 
leak would be 50% above its minimum value. 


(3) EXPERIMENTAL RESULTS 
(3.1) Modulator Circuit 


A modulator constructed according to Cooper’s circuit, using 
pentodes type EF 91, was found to be unsatisfactory owing to 
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non-linearity in the driving circuits. When operated without 
rectifiers, i.e. with the driving circuits connected direct to the 
ends of the primary winding of the output transformer, and with 
a 600-ohm output load, a residual output of irregular waveshape, 
which, although capable of reduction to a level of 90dB below 
the carrier level by resistive and capacitive balancing of the 
driving circuits, was not stable to 75 dB for more than 30min. 
As a final carrier-leak level, when using rectifiers, of better than 
90 dB below the carrier level was theoretically indicated, such a 
circuit was of little use. The use of triodes, instead of pentodes, 
was found to eliminate these non-linear effects, but unfortunately 
the use of a single triode in each driving circuit is insufficient to 
provide the required output impedance as discussed in Sec- 
tion 2.6. The final circuit developed, which is shown in Fig. 9, 


TO SECOND DRIVE CIRCUIT 


Fig. 9.—Modulator using two-valve drives. 


consists of 2-stage driving circuits using triodes in the output | 


stage. Feedback is taken from the cathode resistor of the 
output valve, via a resistive mixing circuit with phase adjustment, 
to the control grid of the input.pentode. This circuit gave very 
satisfactory results and was used to obtain the experimental 
results given in this Section. The output impedance, calculated 
theoretically, was 28 megohms. 


(3.2) Leak near Cross-over 


The leak waveform initially obtained was similar to that 
described by Tucker,! in that large peaks occurred in the region 
of the carrier zeros. As such peaks are not indicated by the 
theoretical treatment, it is apparent that they are either the 
result of some reactive effect in the circuit or possibly due to 
minority-carrier storage in the rectifiers when switched from 
forward to reverse conditions. The latter effects are readily 
observed when the type of rectifier used is subjected to a 2-volt 
peak-to-peak square wave (rise time less than 30 millimicrosec), 
but are not apparent with sinusoids of frequency up to 250kc/s. 
Owing to the one-third power law discussed in Section 2.5, the 
voltage waveform across the rectifiers of the constant-current 
modulator, using sinusoidal carrier, would be of the form 
sin'/3 wt; thus the conditions at the carrier zero, as regards slope 
of waveform, may be approximated by using a constant-voltage 
source at three times the carrier frequency, i.e. 9 kc/s in this case. 
It is unlikely, therefore, that the cross-over peaks are due to 
minority-carrier storage. 

Reactive effects could be produced either by the final-stage 
anode inductors or by the output transformer. In order to 
eliminate one of these unknowns, the output transformer was 
replaced by a differential amplifier whose input was taken from 
low-value resistors connected in place of the primary winding of 
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the output transformer. A differential amplifier for this purpose 

must be capable of meeting a very rigorous specification. Its 
‘Tejection ratio, defined as the anti-phase gain divided by the 
in-phase gain, must be at least 90dB, as the constant-current 
modulator was expected to give a possible carrier rejection of 
this order. The differential amplifier developed by Richards,>* 
was modified by the use of valves of higher mutual conductance 
| (EF 91) to extend the bandwidth to 100 ke/s in order to reproduce 
the cross-over peaks should they occur. By a certain amount of 
valve selection, a rejection ratio of over 90 dB was obtained, and 
| the anti-phase gain, measured at 3 kc/s was 3250. Owing to the 
high anti-phase gain, 15-ohm resistors in the modulator output 
‘were found to be sufficient for satisfactory operation. This value 
| of resistance, when compared with the rectifier forward resistance, 
|also approaches quite well the assumption of a short-circuit 
| made in Section 2. 

The use of the differential amplifier to replace the output 
| transformer, while not completely eliminating the peaks in the 
(carrier zero region, considerably reduced their magnitude. 

A set of oscillograms showing the variation of leak waveform 
‘with variation in peak carrier current is reproduced in Fig. 10. 
'The peaks in the carrier zero region increase with increase in 
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carrier current. Now, if no reactive or minority-carrier storage 
effects were present, the effect of increasing the peak carrier 
current should be to compress, in time, those parts of the leak 
waveform near the carrier zeros, and to insert new parts in the 
region of the carrier peaks; thus the magnitude of the cross-over 
peaks should be unaffected by changes in peak carrier current. 
That this is not so again indicates an inductive effect which must 
now be due to the anode inductors. It is reasonable to suppose 
that if peaks occur without the output transformer, they would 
be, effectively, amplified with it. Another interesting feature of 
the oscillograms, is the ‘squareness’ of the leak waveform, 
disregarding the carrier zero regions, which agrees well with the 
theoretical treatment. 

The four rectifiers used in the modulator, to obtain these 
oscillograms, were not well matched, having forward resistances 
at 10mA ranging from 56 to 120 ohms, and reverse resistances 
at —1 volt ranging from 24 to 280 kilohms. When using selected 
rectifiers to obtain very low carrier leaks, as discussed later, the 
magnitude of these peaks relative to that of the mid-cycle leak, 
was much greater, but as the output leak agreed quite well with 
that predicted from rectifier characteristics using eqn. (16), it 
appears that the peaks contribute very little to the magnitudes 
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Fig. 10.—Waveform of carrier leak at various values of peak carrier current, 


Oscillogra 


ms (a) and those for the cross-overs are with modulator balanced for minimum second harmonic at the stated current. 


Oscillograms (b) are with modulator balanced for minimum second harmonic ata peak current of SmA. 
Oscillograms of cross-overs have deflection sensitivity increased by 70% and time scale expanded ten times. 
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Fig. 11.—Variation of r.m.s. leak output current with peak carrier 
current. 
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fundamental, and harmonic components of this waveform is 
given in Fig. 13 as a function of drive-current unbalance. It is 
apparent from these results that the magnitudes of the d.c. and 
harmonic components are very sensitive to current unbalance, 
whilst those of the fundamental and third harmonics remain 
practically unchanged, even over the relatively large unbalance 
considered, namely +10%. These latter components do change 
a small amount, in such a manner that they are proportional to 
the sum of the carrier currents from each driving circuit. It 
seems, therefore, that the modulator achieves, inherently, a low 
value of carrier fundamental leak which is incapable of being 
reduced or increased by changes in the gains of the driving 
circuits. Thus the gain stability of these circuits is only important 
in the rare cases where balance to the second, or higher-order 
even harmonics, is required. 


(3.5) The Use of a Balancing Voltage-Divider in the Output 


In the conventional ring modulator, it is normal practice to 
insert a voltage divider between the two halves of the output- 
transformer primary winding, as is discussed in the Introduction. 
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of the lower harmonics of the leak output. Thus, for most 
practical purposes, it should be fairly safe to neglect the peaks 
and consider only the mid-cycle leak. 


(3.3) Variation of Leak with Carrier Current 


The oscillograms of Fig. 10 indicate that the value of the 
carrier current does not appreciably affect the setting of the 
generator balancing controls, as is evidenced by the waveforms 
(a) and (6). The variation of the fundamental and harmonic 
components of the leak when the modulator is balanced for 
minimum second harmonic at different carrier currents is shown 
in Fig. 11. From these results it is apparent that the carrier 
rejection improves slightly with increase of carrier current. 


(3.4) Effect of Unbalance in Carrier-Drive Currents 


The oscillograms of Fig. 12 show the effects of resistive and 
capacitive unbalance of the driving-circuit output currents on the 
leak waveform. The variation of the magnitudes of the d.c., 
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Carrier current, 5mA (peak). 
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Fig. 13.—Variation of components of leak output current with resistive 
unbalance of the drives. 


Carrier frequency, 2930c/s; carrier current, 5mA (peak). 
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It may be that such a device, if used in the constant-current 
modulator, would enable the leak to be reduced to an even 
lower level than that predicted. It would be expected that, if 
a lower level was achieved, it would not be time stable, as the 
setting of such a voltage divider would depend on the rectifier 
characteristics in a similar manner to that of the conventional 
ring modulator. This idea was tried experimentally by inserting 
a low-value voltage divider between the terminating resistors, 
the wiper being connected to the ‘earthy’ end of the carrier 
source. As far as could be ascertained, adjustment of this 
voltage divider was incapable of achieving a lower value of 
fundamental leak than was obtained with equality of the ter- 
minating resistors. The difficulty of measuring the small 
differences in resistance is considerable, owing to the low value 
of the total resistance. The fundamental leak obtained with 
transformer termination did, however, agree with the minimum 
value obtained with voltage-divider termination, and as the 
transformer was known to be very accurately balanced, owing 
to the bifilar method of winding used, it is reasonable to suppose 
that this minimum does occur with equal terminating resistors. 
The voltage divider is a useful device, not for reducing the leak 
below the predicted value, but for adjusting the terminating- 
resistor equality. 

An interesting result obtained from the use of the voltage 
divider is that its variation within reasonable limits affects the 
amplitude of only the odd harmonics of the leak, and not that 
of the even harmonics. 


(3.6) The Use of Selected Rectifiers 


Groups of four rectifiers were selected, for their low values of 
A , A, and r;/ry, from a batch of 20 germanium diodes. These 
groups were then substituted in the modulator, and, in order to 
verify the theoretical results for worst leak with rectifiers whose 
characteristics lie within given limits, all six possible rectifier 
combinations were investigated. 


For the first group, (a), the measured parameters were 
Ay= 0°13, A, =0-3, and refr; = 0-46 x 10-3 


Using eqn. (16), the theoretical maximum value of fundamental 
leak is obtained as 5-12 x 10~4ZJamp, ie. the fundamental 
leak is theoretically 66dB below the carrier level. The worst 
value of fundamental leak measured was 71 dB below, and one 
combination gave a leak which was 77 dB below, the carrier level. 


The second group, (0), investigated had the parameters 
A; = 0-25, A, = 0-28, and r/r, = 1-75 x 10-4 


giving a theoretical worst fundamental leak level of 72-6dB. 
In this case, one rectifier combination gave a leak level of 70-5 dB, 
i.e. 2-1dB worse than that indicated theoretically, but this 
discrepancy is not large. One combination of this group gave 
an output leak whose fundamental component was 92:2dB 
below the carrier level. 

From these results it is apparent that selection of rectifiers can 
easily lead to fundamental leak levels in the region of 70dB 
below the carrier level and, with rather more care, it is possible 
to achieve a level of 90 dB. 

With any combination of these two groups, the fundamental 
iwak was substantially independent of gain variations in the 
carrier drive circuits. 


(3.7) Stability Tests 


The temperature stability of the modulator was checked by 
enclosing the rectifiers in a temperature-controlled oven. For 
group (a), a warming-up period of 20min was allowed with the 
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rectifiers at a temperature of 22°C. The temperature was raised 
to 31-5°C and allowed to remain at that value for 10min, in 
order to avoid the effects of uneven temperature distribution, 
after which time readings of leak were recorded. The tem- 
perature was then lowered to 22°C, and after a further period 
of 10min, readings were again taken. The following results 
were obtained: 


LEAK CURRENT, MICROAMP (R.M.S.) 


3rd harmonic 4th harmonic 


2nd harmonic 


Temperature Fundamental 


Thus, increasing the temperature by 9-5°C has raised the 
fundamental output leak from 65:5dB to 62:3dB below the 
carrier level. 

With group (b), the fundamental leak at an initial temperature 
of 21°C was 91 dB below the carrier level. As the temperature 
was slowly raised the leak current decreased, giving a minimum, 
which was 97dB below the carrier level, at 25°C. Further 
increase in temperature caused the leak current to rise so that 
at 31°C the level of fundamental leak was 83dB below the 
carrier level. 

Long-term time-stability tests have not been made, as the 
temperature-stability tests indicate that the changes in rectifier 
characteristics which would normally be expected, either due to 
age or temperature variations, have no great effect upon the 
level of balance obtained. 


(4) CONCLUSIONS 


The use of constant-current carrier drives to each half of a 
split-ring modulator has removed the first-order unbalance 
etfect due to differences of rectifier forward resistance. The 
ultimate limit to the balance obtainable is now set by the mag- 
nitude of the ratio of forward to reverse resistance of the rectifiers 
multiplied by the magnitudes of the differences in both the 
forward and the reverse resistances. Due to the effects discussed 
in Section 2.5, the difference in the reverse resistances is usually 
the more important. 

The important conclusion is reached theoretically, and verified 
experimentally, that the stability of the fundamental component 
of the carrier leak is not affected by changes in gain of the 
constant-current drives, and, for a high balance to this funda- 
mental component, the only requirements of these drives is that 
they should be linear and have an output impedance which is 
higher than the maximum reverse resistance of the rectifiers to 
be used. 

In practice, peaks occur in the leak waveform in the region of 
the carrier zero which are not explained by the circuit theory 
given here. These peaks are apparently due to inductive effects 
in the circuit, and although they have large amplitudes relative 
to the mid-cycle leak, they contribute very little to the magnitudes 
of the lower harmonics of the leak, and their effect should be 
negligible for the majority of practical purposes. 

The setting of the driving-circuit balancing controls is not 
greatly affected by the magnitude of the carrier current used, so 
that the stability of the carrier source is not important. 

A useful criterion of temperature stability is the alteration in 
temperature required to double the leak current. This was found 
to be of the order of 3 or 4°C for the best levels of balance 
obtained, and correspondingly greater, as would be expected, 
for the poorer levels. 
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As regards the applications of this modulator, it is clearly 
most directly valuable in communication equipment, and more 
particularly in measuring instruments. It may be possible to 
apply it to the chopping of d.c. and low-frequency signals—with 
great advantages for sensitive ‘d.c.’ amplification—but this has 
not yet been fully investigated. 

Summarizing: the constant-current modulator is capable of 
achieving much lower levels of carrier leak—of the order of 
90 dB below the carrier level—than have previously been obtained, 
with the added advantage of good stability. 
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SUMMARY 

Theory shows that a simple binary pulse-code modulation system 
does not make the maximum use of a communication channel of given 
bandwidth and signal/noise ratio. It is suggested that an improvement 
in output signal/noise ratio can be obtained by modifying the binary 
code so that errors in its different digits contribute equal amounts of 
noise power to the output. 

The noise contributions of the digits can be equalized by using a 
greater number of pulses to transmit the more significant digits. 
However, the resulting increase in bandwidth introduces additional 
noise which more than offsets the reduced error liability of the digits 
and so causes a worsening of the output signal/noise ratio. An 
attempt to equalize the noise contributions of the digits by using a code 
with a different base for each digit also fails. 

It is possible to make the digits of a binary code group contribute 
equally to the output noise by varying the pulse height or length from 
digit to digit. An improvement in output signal/noise ratio is then 
obtained over a range of input signal/noise ratios. The system using 
pulses of different heights is less complicated than that using different 
pulse lengths; however, it is more complex than a system using the 
simple binary code, and it is doubtful if the improvement in signal/noise 
ratio justifies the additional complexity. 


LIST OF SYMBOLS 


A = Height of quantizing step, volts. 

C = Channel capacity, bits/sec. 

f = Cut-off frequency, c/s. 

J, = Bandwidth, c/s. 

K = A constant (~ 10). 

k = An integer. 
P,, = Noise power, watts. 

n = Average number of effective noise bursts per second. 
P, = Probability of error in rth digit. 
r = An integer. 
= Signal power, watts. 
= Duration of rth digit pulse, sec. 
= Slicing voltage for rth digit, volts. 
= R.M.S. output noise voltage, volts. 
= R.MS. output signal voltage, volts. 
=Improvement in signal/noise ratio due to modified 
code, dB. 

pt = Number of digits in code group. 
o =R.MSS. input noise voltage, volts. 


PNSNAO 


(1) INTRODUCTION 


In pulse code modulation (p.c.m.), the signal to be transmitted 
is sampled at regular intervais and the amplitude of each sample 
is sent by means of a coded group of pulses. The usual method 
of coding is to represent each sample by a group of pulses, each 
ef which can have only two values: either ‘on’ or “Glits, ZAX 
group of n pulses can thus represent 2” values. Each group 
represents the sample value in the binary scale of numbers, which 
uses only two symbols, ‘0’ and ‘1’. i 

At the receiving end, the radio-frequency carrier af any) is 

amplified and demodulated. The resulting pulses are applied 
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to a slicing circuit, which operates and produces a regenerated 
pulse whenever the voltage applied to it exceeds a critical value 
which is chosen to be about half the nominal pulse height. Noise 
on the transmission path can only affect the output if it is of 
sufficient amplitude to cause maloperation of the slicing circuit. 
If the input signal/noise ratio is sufficient for this to happen very 
infrequently, the output signal is substantially free from 
interference. 

Oliver, Pierce and Shannon! have shown that the rate at which 
a p.c.m. system can transmit information is 


12 2) 


KP, (1) 


C = f, log, (1 AF 
Now, the maximum rate at which the channel can transmit 
information? is 


P, 
C = fylog, (1 +5) ns so) 


Egns. (1) and (2) are of the same form. Thus, the use of p.c.m. 
allows bandwidth and signal/noise ratio to be exchanged accord- 
ing to the same logarithmic law as an ideal communication 
channel. However, the power required by the p.c.m. system is 
K*/12 times that theoretically required by an ideal system. It 
should thus be possible to obtain a method of coding which 
provides the same output signal/noise ratio as binary p.c.m. with 
only about one-eighth of the transmitted power. Alternatively, 
for the same transmitted power an improved output signal/noise 
ratio should be obtainable. 

In the simple binary code, each digit is equally liable to error 
due to noise on the transmission path; however, the amounts of 
noise which these errors cause at the output of the decoder are 
unequal. Thus, in a 5-digit code an error in the most significant 
digit results in an error in the amplitude of the output signal 
16 times as great as that caused by an error in the least significant 
digit. The signal/noise ratio of the system should be improved 
by making the most significant digits less liable to error than 
those of less significance, so that they contribute equally to the 
output noise. Possible methods of doing this are investigated 
in this paper. 


(2) SIGNAL/NOISE RATIO IMPROVEMENT IN P.C.M. 
The improvement in signal/noise ratio obtained by the use of 
simple binary p.c.m. has been investigated by Clavier, Panter 
and Dite.* Their analysis will now be extended to cover the 
case of a modified code in which the digits are not equally liable 
to error due to noise on the transmission path. 
If a binary code group has yz pulses, the contributions which 
the pulses make to the decoded output voltage are 
ASDA DA a in, 2 Ae | 221A 
If the probability of an error in the rth pulse of the group is p, 
the mean-square noise voltage produced at the output is 
U 
Ves > p 2 a ee 


r=1 


The probability of error, p,, depends on the average number of 
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times per second, n(V), that bursts of noise exceed the voltage, 
V, at which the pulses are sliced to determine whether they are 
present. Any positive noise burst which exceeds V and occurs 
in the absence of a code pulse will cause a spurious pulse to be 
registered. However, any positive noise burst which occurs 
during a code pulse merely increases its height and so does not 
cause any error. If it is assumed that the presence and absence 
of a code pulse are equally likely, then, on the average, half the 
positive noise bursts exceeding V will result in errors. In addi- 
tion, any negative noise burst exceeding V will be assumed to 
cancel a code pulse which is present, but will not cause error 
when it is absent. Thus, half the negative noise bursts of 
amplitude exceeding V also cause errors. Consequently, the 
average rate at which errors occur (due to both positive and 
negative bursts of noise) is n(V). 

If the noise bursts are independent of each other and have a 
Poisson distribution, the probability of k bursts occurring in an 
interval T is 
eo 


k! 


p(k) = [nV)T] (4) 


If the interval T is that of one digit pulse, only one error can 
occur (k = 1), and if the errors are relatively infrequent 
nV)T <1. 

Eqn. (4) then becomes 


P, =~ n(V,)T, (5) 

Rice* has shown that, if random noise is applied to a circuit 
with an ideal low-pass frequency characteristic, the number of 
noise bursts per second which exceed V is 


no) = Lge 


(6) 


Substituting for p, in eqn. (3) by means of eqns. (5) and (6) gives 


V2 = AP x ratte e V2[202 (7) 


r=1 


This is the output noise voltage for the general case of a binary 
code group in which the pulses may not be equally liable to 
error. 

If the modulating signal is a sine wave, the greatest possible 
amplitude is 4A(2"— 1), The mean-square signal output 
voltage is then - 


V2 = A24 — 1/8 (8) 


In the simple binary code, all the pulses are of equal height 


(2V,) and duration (7;) and equally liable to error. Eqn. (7) 
then gives 
ap u. 
V2= goto — V2 /202 2(r—1) 
» V3 7 >») 2 
S,TyA* 
= (qu — 1)2°8" 6 ¥32)20? 
Ae Daa ©) 

From eqns. (8) and (9) the output signal/noise ratio is 

V2" 237/302 1) 

a 21962 
V2, SAT vee 1) ua Wee [20; (10) 


This result was obtained by Clavier, Panter and Dite.3 
If errors in each of the code pulses are to make equal contribu- 
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tions to the output noise power, then, from eqn. (3), we require 


P= hie ae (11) 


If this condition is satisfied, eqn. (3) becomes 
V2 = wpa? 

Substituting for p, by means of eqns. (5) and (6), 

2 
ee = V2 [202 
From egns. (8) and (12) the output signal/noise ratio is 
V2 _ 32 — 1) 
V2 Buf T; 


Arranging errors in the code pulses to contribute equal 
amounts to the output noise has changed the output noise power 
from the value given by eqn. (9) to that given by eqn. (12); 


Veoh Os meh ex Ve dhe avy 
ee (44 a 1)f,T, 20? 20% 


K 
The parameters /,7, and f,7;, determine the shape of the pulses: 
an acceptable minimum value is 0-5. It will be assumed that 
the same value is used in each case. Putting f;7; = /,7, and 
expressing this power ratio in decibels, the improvement in output 
noise is 


pe (12) 


€ Vi [202 


(13) 


1.€. 


A= 
A = 10 log (——) 
10,3 


VP . 
233) logioé decibels (14) 
(3) CYCLIC PERMUTED CODE 


The cyclic permuted code was proposed by Gray® in order to 
reduce errors in the coding process. An example is shown in 


Table 1. 

Table 1 
Magnitude Representation 
of sample in c.p. code 

0 0000 
1 0001 
2 0011 
3 0010 
4 0110 
5) 0111 
6 0101 
il 0100 
8 1100 


This code has the advantage for coder operation that a unit 
change in the value of the sample to be coded can cause only 
one of the code digits to change its value. However, changing 
the value of one of the code digits can cause a change of more 
than one unit in the value of the decoded sample. For example, 
an error which changes 0100 to 0000 causes an error of 7 units 
in the decoded output. Thus, the c.p. code provides no advan- 
tage over the simple binary code as regards errors caused by 
noise on the transmission path. 


(4) BINARY CODE WITH UNEQUAL NUMBERS OF 
PULSES PER DIGIT 
One method of making the more significant digits less liable to 
error is to represent them by more than one pulse. If a digit is 
represented by a single pulse and an effective noise pulse is also 
present, four possibilities occur, as shown in Table 2. Thus half 
the possibilities result in an error. 
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Table 2 


Signal pulse 
present or 
absent 


Polarity of 
noise pulse 


Right 
or wrong 


If a digit is represented by two pulses, either no noise or one 
or two noise pulses may be present. If noise is present the 
possibilities which arise are shown in Table 3. The number of 
errors which occur is seen to be the same whether a single pulse 
is accepted as the signal ‘1’ or whether two pulses are required 


Table 3 


Right or wrong 


Polarity of 


Signal pulse > 
noise pulses 


present or absent Rosi 


(0) 


PA 

3 00 0+ 
a 00 O= 
ZB 11 O+ 
7 11 o— 


2 Noise pulses 


(a) If presence of a single pulse is accepted as the signal ‘1’. : 
(6) If two pulses are required to produce ‘1’ and a single pulse is regarded as ‘0’. 


° 


to produce ‘1’ and a single pulse is regarded as ‘0’. For both 
(a) and (b), only one-quarter of the single noise pulses produce 
an error in the output and one-half of the noise pulse-pairs 
produce errors. 

If a digit contains three pulses and the presence of two pulses 
is accepted as the signal ‘1’, a single noise pulse cannot cause 
error. By drawing up a Table similar to Table 3 it can be shown 
simply that one-third of the noise pulse-pairs and one-half of 
noise triple pulses will cause errors. Similarly, if four pulses are 
used, and two or three pulses are required to produce the signal 
‘1’, a single noise pulse cannot cause an error and one-sixth of 
the noise pulse-pairs will do so. The results obtained for digits 
containing up to 10 pulses are summarized in Table 4. 

This Table shows that a change from an odd number of pulses 
to the next greater even number halves the rate of error, whereas 
eqn. (11) requires each digit to be followed by one with one- 
quarter of the error rate. This suggests that only the odd 
numbers of pulses should be used. Table 4 also shows that ifa 
group of 2k — 1 pulses is used, k noise pulses are required to 
produce an error. 

If the average number of noise pulses per second is m, the 
srobability of k pulses in time T is given by the Poisson proba- 
bility distribution [eqn. (4)]. If the duration of each code pulse 
is T, the probability of k noise pulses occurring during the time 


ef 2k — 1 code pulses is 


Cre Wakes 
Px [ A € (2k—1)nT 
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Table 4 


Effect of noise pulses 


Single pulses cause error + of the time 
Single pulses cause error + of the time 
2 pulses cause error 4 of the time 
Single pulse causes no error 

2 pulses cause error 4 of the time 
Single pulse causes no error 

2 pulses cause errors + of the time 

2 pulses cause no error 

3 pulses cause error + of the time 

2 pulses cause no error 

3 pulses cause error 4 of time 

3 pulses cause no error 

4 pulses cause error + of time 

3 pulses cause no error 

4 pulses cause error 5 of time 

4 pulses cause no error 

5 pulses cause error 3 of time 


4 pulses cause no error 
5 pulses cause error ="; of time 


[2(k + nT ]k+1 
~ &+D)! 


Pe Oke nme ae 
Pe [Qk —DnTFK +)! 


_ Qk + DEH (CT) 
QRS T)MR-F 1) 


Eqn. (15) shows that if pp.,; = 4p,, the value of nT (i.e. p;) 
is determined; moreover, it cannot be the same for each value 
of k. However, p,; only varies from 0-055 to 0:044 as k varies 
from 1 to infinity. Thus, if p; is about 0-05, eqn. (11) will be 
approximately satisfied and the amounts of noise introduced by 
the different digits will be nearly equal. 

If p; = 0-05, from eqns. (5) and (6), 


20 
V3" iT 
The different digits thus only contribute equal amounts to the 
output noise power when the input signal/noise ratio is that given 
by eqn. (16). The mean-square output noise voltage is then 
given by eqn. (12). 

The modified code requires jz times as many pulses per code 
group as the simple binary code; for example, a 7-digit code 
requires no less than 49 pulses. Consequently, the bandwidth 
of the system must be increased pz times (f/; = pf,), and the 
input noise power increases in the same proportion (o7 = [Loz : 
The probability of receiving noise pulses is thus increased, and 
this offsets their reduced effect when they do occur. If both 
systems use pulses of the same amplitude (V, = V,), the ratio 
of the output noise powers, from eqns. (9) and (12), 


Similarly, Pryi= g7 (2k+1)nT 


Therefore 


if nT <1 (15) 


eV3/203 = 


(16) 


2 Pa) 
V2 (4#—1)f,T, » 203 \o2 
3 Vi . 
(44 — 1) exp 203 1) 1 fi 1 fi, b 
Substituting for V?/207 from eqn. (16) gives 
Vee 3p ( 20 (=D 
ahs oe ji = i T; ) 
V2 gu — 1 37h! 
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Consequently, an improvement in the output signal/noise ratio 
is only obtained if 


The minimum value for /,7; is 0°5, so we require 


44-1 (10 ee 
3 (73 


which is not satisfied by any positive integral value of uw — 1. 
The increase in the number of effective noise pulses thus out- 
weighs the reduced effect of the noise pulses and the signal/noise 
ratio at the output is actually worsened. It is clear that very 
inefficient use is being made of the additional pulses transmitted. 
If it is permissible to use more pulses than the simple binary 
code requires, it is clearly better to abandon binary coding com- 
pletely and adopt some form of error-correcting code® for the 
group of code pulses rather than digit by digit. 


(5) CODE WITH DIGITS OF UNEQUAL BASE 


A possible method of equalizing the amount of noise power 
introduced by errors in the different digits is to make the less 
significant more liable to error than in the case of the simple 
binary code. If this enables the code to be sent over a channel 
of smaller bandwidth an improvement in output signal/noise 
ratio may result. In a group of code pulses, the first pulse could 
have only two values (on and off), the second could have three 
(full amplitude, half amplitude and off), the next four, and so 
on. Alternatively, the first pulse could have two values, the 
second four, the third eight, etc. In generai, the pulse repre- 
senting the rth digit can have n, values. The values for the 
various pulses in the code group and the signal amplitudes they 
represent are shown in Table 5. 


Table 5 


Pulse values Signal amplitude values 


Oil 0, 4 
CDV C sa) (Ce, SO ep 
... (m2 — 2){(m2 —'1), 1 (n — 1)/2nz 


0, 1/(73 = 1), 2/(n3 — 1) 0, 1/2n2n3, 2/2n2n3, 
ail stielisl (13 == 1)/2n2n3 
0, 1/G. —1), 2/@ey, — 1) 0, 1/2non3... Mu; 
a DILTON ents 
seis (My, — 1) (270975 «My 


A noise pulse which coincides with the first digit will introduce 
an error if its height exceeds half the pulse voltage and the error 
voltage will be of value one-half. A noise pulse which coincides 
with the second digit will introduce an error voltage of 1/275 
if its height exceeds 1/(n. — 1), an error of 2/2n, if its height 
exceeds 2/(n, — 1), and so on. Similarly for the subsequent 
digits as shown in Table 6. 

Let the probability of a noise pulse exceeding amplitude x be 
p(x). Provided that the input signal/noise ratio is adequate Gics 
exp V?/20? > 1), we can assume that 


1 z 3 5 2n, — 3 
al a, = | > ol ag = 5} Al za 5 lees =| 
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P{ 1/22 — 1)] = pG) exp [ 


P[1/2(73 — 1)] = p[1/2(@. — 1)] exp | 


P{ 1/2, —1)] =pl1/2(,_;—1)Jexp | 
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Table 6 


Amplitude exceeded 


~ Resultant error voltage 
by noise pulse 


1 } 


1/2(n2 — 1) 
3/2(“2 — 1) 


1/2n2 
2/2n2 


(n2 ~ 1) fan 


1/2n2n3 
2/2n2n3 


(Ons = 3)0Gn 2s 


3 1/2@3 — 1) 
3/2(n3 — 1) 


(m3 — 1)/2n2n3 


1/2non3... ny, 
2/2n2n3... Ny 


(Ons = 3c) 


iP 
is ocr a 


Qny, — 3)/2(ay, — 1) (my — 1)/2ngn3.. . Ny, 


The mean-square noise voltage at the output is thus 


nee {ao PUL = 9) , alt/20s — 0] 
Te Teg 3 2>ngnt, 


oe PLL/2G,. — zl 


22052) an 
2*n5n, .. . i, 


To equalize the noise power contributed by errors in the different 
digits we therefore require 


P[1/2(nz — 1)] = 3 pG) 

PL1/2(5 ae 1)] == n3 p[1/2(ny == 1)] 

Pii/2m, — 1)] = n2.p[1/2m,_, — 1D] 
Now, from eqn. (6), 


il V2 
(ity = 12a 2e2 


ae 4 y2 
(1, — 1) (m3 — 1)? |2e 


Therefore 
1 VA 
] =a 
Beda | (1a — le 
1 PAwihre 
log ny = | — = 
edie Gn = Daas | 202 
1 1 V2 
log 74, = | — 
*  L@y_1— D2 (yy = 122.4202 
hence 72, n3,..., my, can be evaluated. 


It is seen that the values of ny, 13,..., m, depend on V2)20%% 
moreover, they are not integers. It is therefore impossible to 
equalize the noise contributions of the different digits if these 
can have only an integral number of values. 


1 1 v2. 
(a, -1—1?  @,—1) [202 


| 
| 
| 
| 
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(6) BINARY CODE WITH PULSES OF UNEQUAL HEIGHT 


Another method of equalizing the noise power introduced by 
errors in the different digits is to transmit the digits by pulses of 
equal duration T but different height (2 V,). The more significant 
digits are sent by larger-amplitude pulses than the less significant 
digits, so the former can be made less liable to error than the 
latter. At the receiving end of the modified system, the pulse 
corresponding to each digit must be sliced by a discriminator 
circuit set to the appropriate level. This may be done, as shown 
in Fig. 1, by using a separate discriminator for each digit and 


SLICING 
CIRCUITS 


OUTPUT TO 
DECODER 


Fig. 1.—Receiving end of p.c.m. system using pulses of unequal height. 


gating each pulse to the appropriate circuit. Alternatively, a 
single discriminator may be used (e.g. a Schmitt trigger circuit) 
whose reference voltage is changed for each digit to be received. 
_ The amplitudes of the more significant pulses can be larger than 
that of the pulses in a simple binary p.c.m. system which uses 
the same mean transmitter power. It should thus be possible to 
obtain a better signal/noise ratio by using the modified system. * 
The probability of error in each code pulse is given by eqn. (5). 
For the errors in the code pulses to make equal contributions to 
the output noise, the ratios of their error probabilities to that of 
the least significant pulse is given by eqn. (11). From these two 
equations 
oP 
5 [exp (— V2/207)]4"-1 = 2 exp (— V?/207) 
Therefore V2 = V? +2 dog. 4)(r — Io? 


= V2[1 + 2-772r — 1)07/ V2] (17) 


The relationship between the pulse heights thus depends on 
the noise level. If the pulse heights are adjusted for one par- 
ticular value of o, at other noise levels errors in the pulses will 
no longer contribute equal amounts of noise to the output. 

If the amplitude of each pulse in the group is 2V,, r= 1, 
..., , the mean-square voltage of the group of code pulses is 
4V2, where 

1 # 
ESE ADEE Betts (a 1)o7/V 7] 
-Hr=1 
This is equivalent to the mean-square voltage (4 V P)of each pulse 
in a simple binary p.c.m. system which uses the same transmitted 
power as the modified system. 

By substituting from eqn. (18) in eqn. (14), the improvement 

A in signal/noise ratio obtained by using the modified code is 


3 


The improvement obtained for various numbers of digits in the 


code group is shown in Table 7. 


i is limi mean power, 

* If the rating of the transmitter is limited by peak power rather than c 

> will not be aaciibie to obtain an improved signal/noise ratio by means of the modi- 
“<d code, but merely a reduction of mean transmitted power. 


Vou. 105, Parr C. 


(18) 


(19) 


) — 3-01( — 1) decibels 
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Table 7 
No. of digits, u Improvemient, A 
dB 
1 0 
2 1 
3 25 
4 4:3 
5 6°3 
6 8-4 
7 10:8 
8 13*3 


The improvement shown in the Table is obtained only for the 
noise level for which the modification is made. If the noise level 
changes, the improvement will be less. If the system is adjusted 
for a noise voltage 9, the relationship between the pulse heights 
is given by eqn. (17) with o = op. At other input noise levels 
the output noise level is given by eqn. (7). With an r.ms. 
input noise voltage o’ the mean-square output noise voltage is 
thus 


: AAT, & 
V2 = ee i »» 22°—) exp (— V2/20”) 
AAf T. v. 
= at ‘Texp (— V?/20)]  22¢-D [exp — 1-386(r — 1)] 
V 1 ai iae 
(oo/o ) 
AAT, es ee i eee) 
= Be [exp (— V?]20? yas ganar 


The mean-square output voltage with the simple binary code, 
V2, is given by eqn. (9), and Vi is given by eqn. (18). 
Therefore 
La 4v—o§19"2) 3 x¢ g(u—1)03/20’2 


Veo 4u—] 


jez 4l—a5/o? 


Expressing this ratio in decibels gives as the improvement in 
output signal/noise ratio 


(4% — 1)(1 — 4126/07) 
3[1 — 44 0-5/0] 
— 3-01(u — 1)o)/o” decibels (20) 


For input noise voltages exceeding o) the improvement, A, 
obtained, although smaller, is still positive. For input noise 
voltages less than op, the improvement is again less than the 
maximum and vanishes when the noise power is approximately 
halved. At still smaller input noise powers the simple binary 
code is better than the modified code. In Fig. 2, curve (a) shows 
how the output signal/noise ratio varies with input signal/noise 
ratio for a simple binary code system using 5 digits. Curves (d) 
and (c) are for the modified system adjusted to provide maximum 
improvement in output signal/noise ratio at input signal/noise 
ratios of 10dB and 13 dB respectively. 


A = 10 logjo 


(7) BINARY CODE WITH PULSES OF UNEQUAL LENGTH 


If the digits are transmitted by pulses of different duration, 
they can be received by circuits of different bandwidth and so be 
subject to different amounts of noise. The more significant 
digits can be sent by longer pulses than the less significant digits: 
they can therefore be received by circuits of smaller bandwidth, 
which thus receive less noise than the shorter pulses of the less 
significant digits. The different pulses can be of equal height 
and applied to the same slicing circuit as shown in Fig. 3. 

The probability of error in each code pulse is given by eqn. (5). 
For the errors in the code pulses to make equal contributions to 

14 
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OUTPUT SIGNAL/ NOISE RATIO, dB 


4 6 8 10 12 14 16 18 
INPUT SIGNAL/NOISE RATIO, dB 


Fig. 2.—Signal/noise ratio improvement in p.c.m. systems. 


(a) Using simple binary code. — ; ; f f . 
(b) Code modified to give maximum improvement at 10 dB input signal/noise ratio. 
(c) Code modified to give maximum improvement at 13 dB input signal/noise ratio. 


RECEIVING 
CIRCUITS 


SLICING 
CIRCUIT 


INPUT OUTPUT TO 


DECODER 


Fig. 3.—Receiving end of p.c.m. system using pulses of unequal length. 


the output noise, the ratios of their error probabilities to that of 
the least significant pulse is given by eqn. (11). From these two 
equations 


4r- ELS V 2/202 =f; Lem V 2/202 


The noise power for each digit is proportional to the bandwidth 
of its receiver, and it will be assumed that £7, = /,T;; 


i.e. ee ae 
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Therefore 
4-1 exp (— V2T,/203T,) = exp (— V?/207) 


VY /G 
— mei — 1 = For 1 lo eS 4 
20? & ( igs 


T. =T, [1 42-77 — Dov] . (21) 


The similarity between eqns. (21) and (17) should be noted. As 
in the system with variable pulse height, the relation between the 
pulses depends on the noise level. If the pulse lengths are 
adjusted for one particular value of o,, at other noise levels errors 
in the pulses will no longer contribute equal amounts of noise 
to the output. From eqn. (21), the mean pulse duration is 


i] 2 

T, =— dT, 

K 1 
= T,[1 + 1-386(4 — 1)ot/ v7] 

This is the duration of each pulse in a simple binary p.c.m. 
system which uses the same transmitted power as the modified 
system. The bandwidth required by the simple binary p.c.m. 
system is f, =/,7,/T,, and the mean-square input noise voltage is 

oS te ot holhi = ofT,/T, 
o7/[1 + 1:386(% — Nor fv] 
The improvement in signal/noise ratio due to the modified code 


is obtained by substituting from eqn. (22) in eqn. (14), which 
gives 


(22) 


A = 10 logy (eo) — 3-01(y. — 1) decibels 


This result is identical with eqn. (19). The improvement in 
signal/noise ratio which can be obtained by varying the pulse 
length is thus equal to that obtainable by varying the pulse 
height and is given by Table 7. 


(8) CONCLUSIONS 


It is possible to obtain a better output signal/noise ratio than 
that provided by simple binary p.c.m. by making the more 
significant digits less liable to error than the less significant ones 
so that they contribute equally to the output noise. This can 
be done by using a greater number of pulses to transmit the more 
significant digits. However, the resulting increase in bandwidth 
introduces additional noise which more than offsets the reduced 
error liability of the digits and so causes a worsening of the 
output signal/noise ratio. An attempt to equalize the noise con- 
tributions of the digits by making each use a code with a different 
base also fails. 

It is possible to make the digits of a binary code group con- 
tribute equally to the output noise by varying the pulse height 
or length from digit to digit. An improvement in output 
signal/noise ratio is obtained over a range of input signal/noise 
ratios. At high input signal/noise ratios, however, the modified 
system gives more output noise than the simple binary code 
system, but the output noise is then usually so low that the 
deterioration is not important. The system using pulses of 
unequal height is less complicated than+that using pulses of 
unequal length, which requires separate receivers of different 
bandwidth for the different digits. However, the system using 
pulses of different height is more complicated than a simple 
binary system, and it is doubtful if the improvement in signal/noise 
ratio obtained justifies the additional complexity, particularly in 
view of the already excellent noise performance of simple 
binary p.c.m. 


FLOOD: NOISE-REDUCING CODES FOR PULSE-CODE MODULATION 


(9) ACKNOWLEDGMENTS 


Acknowledgment is made to Siemens Edison Swan Ltd. for 
permission to publish the paper. Thanks are due to Mr. J. E. 
Halder for calculating the results shown in ions 


(10) REFERENCES 


(1) OLIVER, B. M., Pierce, J. R., and SHANNON, C. E.: ‘The 
Philosophy of P.C.M.’, Proceedings of the Institute of Radio 
Engineers, 1948, 36, p. 1324. 

(2) SHANNON, C. E.: ‘The Mathematical Theory of Communica- 


397 


tion’, Bell System Technical Journal, 1948, 27, pp. 379 
and 623. 

(3) Cravigr, A. G., PANTER, P. F., and Dire, W.: ‘Signal to 
Noise Improvement in a Pulse-Count Modulation System’, 
Proceedings of the Institute of Radio Engineers, 1949, 37, 
fd), SSS). 

(4) Rice, S. O.: ‘Mathematical Analysis of Random Noise’, 
Bell System Technical Journal, 1944, 23, p. 282. 

(5) Gray, F.: ‘Pulse Code Communications’, U.S. Patent 
No. 2632058, March, 1953. 

(6) Hammine, R. W.: ‘Error Detecting and Correcting Codes’, 
Bell System Technical Journal, 1950, 29, p. 147. 


DISCUSSION ON THE ABOVE MONOGRAPH 


Dr. D. A. Bell (communicated): The unequal significance of 
digits is the weakness of p.c.m., but its strength is the fact that 
it is quantized; an approach to channel capacity in the Shannon 
sense depends on the receiver being able to make a choice out 
of a finite collection of possible messages. A difficulty is that 
the Shannon formula C = f, log, (1 + P,/P,) refers to a rate 
of communication with negligible risk of error; which in the 
p.c.m. case would mean communication with ‘noise’ consisting 
only of quantizing noise. The communication rate for p.c.m. 
suggested by Oliver, Pierce and Shannon is based on the assump- 
tion of a finite but tolerable error-rate, so that it is not strictly 
comparable with the channel-capacity formula until a correction 
has been made for this accepted error-rate. 

It is interesting to compare Dr. Flood’s Table 4 with the 
theoretical minimum number of digits required for an error- 
correcting binary code to replace each single digit. If the code 
group is regarded as one information digit plus c check digits, 
the requirement is 2° > F, where F is the number of different 
kinds of error which may occur, inclusive of the no-error case. 
If there are just & interfering noise pulses amongst c + 1| digits, 
the number of possible faults (kinds of error) is 


(c+ DYkMc+1—28! 


But if the number of interfering pulses may be anything from 
0 to k inclusive, 
k (c + 1)! 


Sire PO is aN 


Agente SALON) 
Table A shows the values of F for values of c + 1 from 1 to 10 
and k from 1 to 6, together with the values of 2°. The terms in 
the summation in eqn. (A) are binomial coefficients, so that if k 
were equal to c +1 we should have F = (1 + 1)et! = 2¢+1. 
Since the c additional digits can cover only cases up to F = 2°, 
k must be less than c -+ 1 in such a way that Fis halved. This 
is possible if the binomial series has an even number of terms, 
ie. if c + 1 is odd and there is then a number k such that F = 2° 

exactly. This is why an odd number of pulses per digit is more 
satisfactory than an even number. 

The major difference between the author’s approach and 
Hamming’s is that the former uses a logical test on an arbitrary 
code, while the latter has constructed codes in such a way that 
tie location of error is indicated by a set of parity checks. In 

‘either case the additional digits must be sufficiently numerous 
to contain the exact information required to correct the error. 


Table A 


No. of pulses 
per digit, c + 1 


SOMDIADANRWNK 
—SKOODNDAMAARW 


_ 
— ee 


Table 3 shows for the case of two pulses per digit an error 
rate of 4+ if one pulse is mutilated by noise, and an error rate 
of + if both pulses are mutilated. An error rate of 4 in a binary 
system represents zero information, but in a realistic assessment 
one should presumably weight the occurrence of one and two 
noise pulses respectively according to the Poisson formula, or 
if p; is small one might put p, ~ p? and po ~ 1 — py — po. 
For example, with p; = 10% the system using two pulses per 
digit would produce 24% false digits due to single noise pulses 
and 4% due to pairs of noise pulses, making 3% overall error 
rate. 

The fact that the use of an error-correcting code group for 
individual digits does not improve the communication rate is 
consistent with the basic philosophy of communication theory. 
The rate for a given signalling power can always be increased 
by increasing the bandwidth, but only if the resulting signals are 
interpreted by a probability estimate of the whole, not by seeking 
certainty for each element of the code group. In the multi- 
dimensional geometric analogue, one must look at the resultant 
distance between different message points, not at the differences 
in individual co-ordinates which correspond with the individual 
elements of the code group. 

Dr. J. E. Flood (in reply): I am grateful to Dr. Bell for pro- 
viding a useful addition to Section 4 of the paper. His dis- 
cussion of an error-correcting code for a single information digit 
provides an interesting explanation of the results in Table 4. 
The paper points out that it is better to use an error-correcting 
code for the whole group of pulses rather than for each separate 
digit. Dr. Bell has correctly stated the reason for this. 
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SOME OPTIMUM FOUR-TERMINAL NETWORKS HAVING GIVEN INPUT AND 
OUTPUT SHUNT CAPACITANCES 


By O. P. D. CUTTERIDGE; M.Sc.(Eng.), Ph.D., Associate Member. 


(The paper was first received 28th May, and in revised form 16th September, 1957. It was published as an INSTITUTION MONOGRAPH in 
February, 1958.) 


SUMMARY 


The paper deals with the open-circuit transfer impedances of low-pass 
4-terminal networks having given parasitic capacitances shunted across 
both input and output terminals. The discussion, in the first instance, 
is in terms of the poles and zeros of the transfer function, although 
physical realizations of optimum networks are also derived. 

The first part of the paper deals with the adjustment of the driving- 
point impedances to produce a transfer impedance having a maximum 
d.c. gain consistent with given pole and zero positions; various cases 
having two and three poles are considered. In the remainder of the 
paper optimum pole-zero patterns yielding maximum-gain/rise-time 
quotients consistent with monotonic response are obtained. 


LIST OF SYMBOLS 


w = Angular frequency. 
p = jw = Complex frequency. 
Z11> 2125 221) 222 = Open-circuit impedance functions of a 4-ter- 
minal network (4-pole). 
A = Nodal determinant of a 4-pole system. 
Ay1, Ay2, Ay2Various cofactors of A. 
R23, 2217, R22 = Real parts of 214, 212 and 25. 
C,, Cy = Shunt capacitances at ends 1 and 2, respec- 
tively, of a 4-pole system. 
C = Shunt capacitance at either end of a sym- 
metrical 4-pole system. 
a, b = Parameters used in Sections 2.1, 2.2, and 2.3. 
k = Parameter used in Sections 2.1, 2.2 and 2.4. 
x = Parameter used in Sections 2.1 and 2.2. 
y = Parameter used in Section 2.2. 
k,, k, = Parameters used in Sections 2.3 and 2.5. 
X1, X27 = Parameters used in Sections 2.3, 2.4 and 2.5. 
aloe 
Zp = 21 + 242. 
a, 8, y = Parameters used in Sections 2.4 and 2.5. 
Tr = Rise-time using Elmore’s definition. 
@ = Figure of merit of network using Elmore’s 
rise-time definition. 

Dax = Maximum value of ® obtainable with 
monotonic response. 

f(t) = Transient response to a step-function drive, 
normalized to unity at infinite time. 

G = Direct-current gain of network. 

A, B = Real and either of the imaginary parts, 
respectively, of a pair of conjugate com- 
plex poles. 

JD) = SEY. 


(1) INTRODUCTION 


The methods used here are basically the same as those adopted 
by the author in a previous paper! on 2-terminal networks. Thus 
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no particular form of network is assumed at the outset and the 
discussion is conducted in terms of the poles and zeros of the 
network functions. The maximum obtainable d.c. gain of the 
open-circuit transfer impedance of a passive 4-terminal network 
is, for given poles and zeros, limited by the passive restrictions 
and the parasitic shunt capacitances. In the following Section 
maximum possible d.c. gains are derived for some networks 
containing up to three poles. 


(2) OPEN-CIRCUIT TRANSFER IMPEDANCES HAVING 
MAXIMUM POSSIBLE D.C. GAIN 

As is well known,? the external behaviour of a linear 4-terminal 
network (4-pole) is completely determined by four independent 
functions of frequency (or functions of p, the complex frequency), 
e.g. the open-circuit impedance functions Z;;, 212, Z2; and Zp. 
For a passive 4-pole composed exclusively of reciprocal elements, 
such as that being considered here, z; = Z). 
the 4-pole is symmetrical, z,; = Z22. The expressions for the 
open-circuit impedance functions in terms of the nodal deter- 
minant, A, of the 4-pole and its various cofactors are 


wAL 
Zs Way 
A 
249). ae aa aa. A ; (i) 
A 
op ae 


Thus, for a 4-pole composed of lumped linear elements, the 
z’s are ratios of polynomials in p with real coefficients, and 


stability considerations confine the poles of the z’s to the left-half — 


p-plane. The zeros of z,; and z. are also confined to the left- 
half p-plane, as are those of z,, in the case of a minimum-phase 


4-pole, but in general the zeros of z,. can lie anywhere in the. 


plane. 
The fact that the 4-pole is passive means that the following 


restrictions are placed upon the real parts of the z’s at real 
frequencies: 


Rz 4, = 0 
£2, 22x, — (Az, > 0 
R25 Ss 0 


(2) 


These restrictions were first derived by Vaulot? and later by 
Gewertz,4 who also gave a general synthesis procedure for a 
linear passive 4-pole. A much simpler derivation of these 
restrictions was given by Tellegen.> 

If a 4-pole has parasitic shunt capacitances across both ends, 
various additional restrictions are imposed on the open-circuit 
impedance functions. For the transfer impedance z,5, the 
number of zeros cannot be greater than one less than the number 
of poles, and for the driving-point functions Z4, and z5, the 
number of zeros must be equal to one less than the number of 
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If, in addition, — 
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poles; this is to ensure correct behaviour at infinite frequency. 
If the number of zeros of z;, is less than the number of poles by 
at least two,* then, at infinite frequency, Z1, and z>, tend to 
I/pC, and 1/pC, respectively, where C; and Cy are the shunt 
capacitances at ends 1 and 2, respectively,t and p = jw. By 
substituting these restrictions in eqns. (2), one can obtain an 
upper limit to the d.c. gain of z,;,. The exact form for the 
optimum 2), functions will depend on the particular pole-zero 
combinations, and some of the simpler cases are now considered. 


(2.1) Symmetrical 4-Pole with Transfer Impedance 2,2 having Two 
Poles and no Zeros 


Let the parasitic shunt capacitance at both ends of the 4-pole 
be C. The open-circuit impedance functions can be written 


EU | 
(3) 
Z4(p) = : Estas 
ee C p*+2ap+b 
where a >0,b>0,x>0. 
; k(b — w?) i 
Hence &2z,,(jw) = (Te ORY es 
2p 
Bat in tioy 1 xb + w*(2a -- x) 


C (6 — w?)? + 4a2w? 

Substituting eqns. (4) into the passive restrictions (2), we have 
20a. Ie ee ee () 

[xb + wa — x) P — C2k?[b — w* 2? > 0 

‘which finally reduces to 

(x? — C?k*)(b*) + 2(w*b)[x(2a — x) 4+ C7k?] 

+ w*[Qa—xyr—CkK]>0 © 


For condition (6) to be satisfied by all w? > 0, we must have 


and 


x* > Ck? (7) 
(2a — x)? > Ck 
Thus 
(CVA) 3 = @? } (8) 
given by x=a 
a 
'o oe 
giving (Z12)opt a C(p? oc 2ap a b) Hs 


; he (p + a) 
with corresponding 21; | 


1 
C (p? + 2ap + db) 


1/2aC 


Big. 1.—Lattice realization of optimum network for symmetrical 4-pole 
with transfer impedance z12 having two poles and no zeros. 


* If the number of zeros of z12 is one less than the number of poles, the position is 


_ _sifferent; such a case is considered in Section 2.3. 


+ C; and C2 will have this significance throughout the paper; C will be written for 
2, and C2 in the case of a symmetrical 4-pole. 
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W2ac 


1/2aC 
Vac 


(4a2b)/4abC 


(@) (b) 

Fig. 2.—Unbalanced networks equivalent to lattice network of Fig. 1. 

(a) Bridged-T network. 

(5) m-network, realizable for 4a2 > b. 
The corresponding physical network can be realized as a sym- 
metrical! lattice in the usual manner;° the realization for z,> with 
the positive sign is shown in Fig. 1. Fig. 2 shows two unbalanced 
networks developed from the lattice of Fig. 1. 


(2.2) Asymmetrical 4-Pole with Transfer Impedance z,, having 
Two Poles and no Zeros 


Let C; and C, be the parasitic shunt capacitances across ends 1 
and 2 of the 4-pole. The open-circuit impedance functions then 
take the following forms: 


Z12(p) = 
12\P p2 + 2ap +b 
1 pt+x 
Z14(p) GC P- aE 2ap EC b (10) 
Dip) ee 
es Cy p? + 2ap +b 
Whereas 05D ie= Oh Om yi (: 
WS Kb Ea) 
Hence Rz, (jw) a oe a 
.., 1 xb+a? Qa—x) 
Rz\(jw) Gs (b =. w)2 4 4a2w2 ad 1) 
eee eee Cede) 
and Rz7(jw) Chih aon eee 
Thus £#z,, > 0 and &z,, > 0 imply, respectively, 
20 
Me (12) 
and 2Za>Sy) 


and the restriction #z,;%z.. — (#z1)* > 0 yields 
b*(xy — CyCyk*) + bw? [x(2a — y) + y(2a — x) 

+2C,C,k?] + w4[(2a— x)(2a— y) —C,C,]>0 (13) 
For condition (13) to be satisfied for all w* > 0, we must have 


xy => C,Cyk? (14) 
(2a — x)(2a — y) > CiC)k? 
From conditions (14) it is easily seen that 
(Ci GIP ax =a (15) 
given by eye 
Thus 
(212)opr = 4 = 
“12ont  * (p+ 2ap + BV (Cis) 
with corresponding 
ape 1 pa (16) 
1 C; p? + 2ap +b 
1 +a 
and Zz P 


22" Cy p? + Zap +5 
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One possible physical realization of the network corresponding 
to eqns. (16) is easily obtained by examining the form of the 
z-matrix of a 4-pole preceded by an ideal transformer (see, for 
example, Reference 2). It is then seen that, by preceding one 
of the optimum 4-poles derived in Section 2.1 by an ideal trans- 
former of turns ratio 1 : +/(C,;/C>) [or, equally well, by following 
the 4-pole by an ideal transformer of ratio 1 A/(C31Gp) a 
realization corresponding to eqns. (16) is obtained. 


(2.3) Symmetrical 4-Pole with Transfer Impedance 2, having 
Two Poles and One Zero 


The open-circuit impedance functions can be written: 


“ _ _ kyptk 
712(P) p? + 2ap +b 
(17) 
ol ar 
and Z14(D) = pe ae 2ap Tay 


where a> 0,5>0,x, > 0, x,>0. 

Owing to the form of the transfer function in this case, the 
driving-point function no longer tends to 1/pC at infinity, where 
C is the parasitic shunt capacitance at each end of the 4-pole. 
The effect of the shunt capacitance can be introduced as follows: 


(x2 — ky)p + 1 — Ay) ] 


= = % ss 
SG ace) p? + 2ap +b (18) 
(x2 + ky)p + (x + ky) 
d = Be, > =a — 
an 2141 + 212 b> Say Pen DapeeD 


Hence, at infinite frequency, Z, becomes a capacitance of 
1/(x. — ky) and Z, becomes a capacitance of 1/(x, + k,). 
Thus, equating the smaller of these capacitances to C and making 
the larger capacitance positive, 


and ao Ome | ved 
or oe re 0 
and x2 +k,>0 eS = 
The real parts of z,;,; and Zz, are easily obtained as 
Bl kan w?(2ax, — x4) ] 
As) (b — w?)? + 4a%w2 
21 
bk, + w*(2ak, — ky) 2D) 


and Rz,>(jw) SS 


(b — w?)? + 4a2w?2 
Thus Zz,, > 0 and (#z,;)? — (Zz,2)? > 0 imply, respectively, 


2ax> — x1 0 (22) 


and 
b°(xt — k4) + 2bw? [x (2axy — x1) — kak, — k,)] 
+ w4[(2ax, — x1)? — Qak, — k,]>0 (23) 
For condition (23) to be satisfied it is necessary that 


xj > ke (24) 


and (2ax, — X4)* = (Qak, = k,) 5 (25) 


On substituting conditions (19) and (20) in (25) and optimizing 
k, in accordance with (22) and (24), the optimum z,, is finally 
given by 

4 1 (k,C)p +a 
~ C p? + 2ap +b 


(Z;2)opt = (26) 
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with corresponding 


1 “G0 pera 
= 2 

Z11 C p? + 2ap + b a 
where 0<k,C <1/2 (28) 
and 1>x,C > 1/2 

= | 

and such that x2C + kaC (29) 
and XC > koC 


Eqn. (26) was obtained on the basis of k; and k, having the same 
sign. If k,; and k> are of opposite sign, the optimum value of 
k, is found to occur when ky = 0, which means eliminating the 
zero of z,> initially assumed to be present. This case then 
reduces to that considered in Section 2.1, with identical results. 

Inserting eqns. (26) and (27) in (18), and taking the positive 
sign in eqn. (26) for concreteness,* we have 


1 (XC = k,C)p 


it~ in Aa Cea aD (30) 
1 p+2a 2 
and Z11 } Z12 Li Cp? a 2ap gis i 5 (31) 
: 1 Pp 2a b 1 
Since = -- 52 
Za *a—ky -%, ky %%.—kyp a 
and ! : 


Sess { 
Fag be ESTES VEC 


one possible physical realization of the 4-pole is as shown in 
Fig. 3. 


(xp-kp)/2a (xp-ko)/2a 


(x p-kp)/b (xo-kp)/b 


11(xp-kp) 


Fig. 3.—Lattice realization of optimum network for symmetrical 4-pole 
with transfer impedance z;2 having two poles and one zero. 


(2.4) Symmetrical 4-Pole with Transfer Impedance Z4, having 
Three Poles and no Zeros 


The open-circuit impedance functions can be written: 


k 


212(p) = 
) 12.) p> +op? + ppty 
> (34) 
and Zi 
C pup" + pp +¥ 
where Xx, > 0, x, >0,«>0,8>0, y>0 
and, in addition, CB ayaa (35) 


which is obtained by applying the Hurwitz stability criteria to 


the cubic polynomial, whose zeros must be confined to the left 
half-plane. 


* Th : hr ‘ ; 
aa Gu, negative sign in eqn. (26) would only interchange Zg and Z> in eqns. (30) 


(33) 9f 
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oo. a a4 k(y = aw?) 
12 (y — aw)? + w@(B — w)? 
aa | (36) 
Az, (jor) —L 117 + 0B = max— y) boa — xy) 
11 G (y = aw)? ae w>(B = ww)? 


Substituting eqns. (36) into the passive restrictions (2), we have 


xy + w(x28 


xa—yY+ute—x)>0 (7) 


and 
[xy + w(x>8 Ges — y) + w*(a — X,)]? 

— C7k*(y — aw’)? >0 (38) 
Now, for k > 0, condition (38) reduces to 


Xyy + w(x8 — xy — y) + w(x — xy) — Ck(y — aw) > 0 
for 0O<awt<y (39) 
and 

Xyy + w(x2B — xy — y) + wa — x2) + Ck(y — aw’) > 0 
for y < aw? < 00 (40) 


Some immediate consequences from conditions (37)-(40) are 
(41) 
(42) 


Ci = 8%) > U 
2%) = Ce SV 
Assuming* that (Ck), is obtained when x, = Ck, and sub- 
stituting this in condition (40), 
2yCk + w(x.8 — y — 2«Ck) + w*(a — x2) > 0 


y< aw? << © (43) 


for 


The limiting value of Ck from (43) is obtained when the left-hand 
side is a perfect square, the condition for which is 


(x28 — y — 2aCk)? = 8yCk(a — x») (44) 

If both sides of eqn. (44) are differentiated with respect to x, and 

d(Ck)/dx, is put equal to zero, (Ck) mq is finally obtained as 
KOK )ezy ee 46 : 


Be 


given by X B 


(45) 


(46) 


It will be found that these values satisfy (37) and that (38) 
reduces to 


(y/B2)w2(w? — BY [yw + BloB — 2y)] > 0 


Thus (Ck) max Will be given by eqn. (45) only provided that 
aB > 2y, since otherwise (47) will not be satisfied at very low 


frequencies. 
B/2 


(47) 


Hence = (212) opr = + CGR ER ED Ey) 
i i 48) 
th ondin ( 
with corresp g p> + («8 — y)plB + B/2 | 
“Al ~~“ C(p3 + ap? + Bp + 
provided that aB > 2y (49) 


A possible lattice realization of the corresponding 4-pole is shown 
i. Fig. 4 for z,> with positive sign. 


* It can easily be verified that this does, in fact, yield the maximum value of Ck. 


(6) z: — ily 
Blyc 
SG 


Fig. 4.—Lattice realization of optimum network for symmetrical 4-pole 
with transfer impedance z;2 having three poles and no zeros. 


(a) For $3(a8 — 2y) > y(@R — y)2 


L B2(aB — y) 
SCE 2y)'— 7CCs)2 
Ry B(aB — y)? 


~ BC(aB — 2y) — yC(aB — y)2 


(6) For R3(a8 — 2y) < y(@B — y)2 
SxGRi= 2 RR 2y) 
B2y~C(aB — 2y) 
z y?C(aB — y) 
y@e — ¥)? — Be — 2y) 


Ry 


Cp = 


(2.5) Symmetrical 4-Pole with Transfer Impedance z,, having 
Three Poles and One Zero 


The open-circuit impedance functions are here 


ig Oya 
i p> +op?+ Bot+y a 
1 274 xp+x 
and Z4(p) = Z be - 


C pi + ap? + bp +y 


where, as in Section 2.4, a >.0, 8 > 0). 0,74 = 0 0 
and «8 > y. Further, in the first instance, take k; > 0. Then 
(2z,1)* — (#z,)* > 0 finally reduces to 


[xy + @7@oB — xx — y) + wa — x2)P 
— Clkyy + w(koB— kyo) — kyw4*P > 0 (51) 
Proceeding as in Section 2.4, we finally obtain 
(Ck) max = 48 
aB — y 
B : 


Substituting these values in condition (51) yields 


(y/B\@? — BY [w?e28 — CkB — y) 
+ w(x — x2 + Cky)]>0 (54) 


Thus (Ck1)max Will be given by eqn. (52) only provided that 
x28 — Ck.B> y 
x28 — CkoB < a8 


otherwise (54) will not be satisfied at very low and very high 
frequencies. Eqn. (53) and conditions (55) and (56) together 


(52) 


given by 


(55) 


and (56) 
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determine the following ranges of values of x. and Ck, consistent 
with Ck, = 48: 


a 
oe Br > 5 (57) 
— 2 
ad x 8 a a8 — 2y (58) 


Thus k, will be negative (indicating a non-minimum-phase 
transfer impedance) over the whole range indicated by condi- 
tion (58) unless «6 > 2y. 

It will be found that the pair of values x, = 4a and 
Ck, = («8 — 2y)/28 satisfy the equation #z,,> 0 provided 
that the Hurwitz condition, ap > y, is satisfied. Other pairs of 
values of x, and Ck, from conditions (57) and (58) will satisfy 
&z,;> 0 only if an additional restriction exists on the values 
of «, B and y. 

For example, it is easily verified that 


y 5 deel 
ye [Ae 
X2 a 2p” G 2 2B | 
59 
are possible only provided that oo 
Sy > oB > y J 
and Pe ae a gf. Ck, ==() | 
are possible only provided that | (60) 
a8 > 2y 
which latter result checks with that obtained in Section 2.4. 
To summarize: 
,= + 
Crrdoot = = GIF ap? + Bp +) 
with corresponding (61) 


pt + xp + B/2 
C(p? + ap? + Bp + y) 

where Ck, lies within the range given by (58) and the correspond- 
ing value of x, is obtained from eqn. (53). In general, an addi- 
tional restriction exists on the values of «, 6 and y, its exact 
nature depending on the value of Ck, selected. Two examples 
of this are given in eqns. (59) and (60) and no extra restriction 
is, in fact, necessary when Ck, has its maximum possible value 


of (a8 — 2y)/28. 


(3) MAXIMUM-GAIN/RISE-TIME QUOTIENTS CONSISTENT 
WITH MONOTONIC RESPONSE 

In this Section, various transfer-impedance functions having 
maximum gain, derived in Section 2, are taken in turn and their 
poles and zeros adjusted so as to produce a monotonically 
increasing output having minimum rise-time when a step-function 
input is applied. 

In order to obtain a measure of the relative efficiencies of net- 


works in this respect, it is convenient to define the figure of merit 
of a network as 


Z11 


Gain/rise-time of network 
Gain/rise-time of parallel RC network* 


Its value for a particular network will depend somewhat on the 
definition of rise-time adopted. Two definitions will be used 
here; one is the time taken for the response to a step-function 
drive to rise from 10% to 90% of its final value, and the other is 
Elmore’s® definition. The latter is applicable only in the case of 
monotonic response but has the advantage of being easily derived 
from the Laplace transform of the output. Thus, if F(p) is the 


* The shunt capacitance of this network should be equal to the sum of the input 
and output shunt capacitances of the 4-pole system under consideration. 


CUTTERIDGE: SOME OPTIMUM FOUR-TERMINAL NETWORKS HAVING 


p-multiplied Laplace transform of the response, normalized to 
unity at infinite time, to a step-function drive 


1+ ap + agp? +... + 4mnp™ 


= (62) 
ene FQ) = 75) ps es 
then Tp, Elmore’s definition of rise-time, is given by 

T2 

or = b? — a? + 2a, — by) . (63) 


(3.1) Maximum Figure of Merit Obtainable with Two Poles and 
no Zeros 


The optimum function z, is given in eqn. (16) as 
a 
Cidoo = + GF Dap + WC 
For this function, Tp is easily shown to be given by 
Temes 2(2a? — b) 
27 ~2Sté«é 


and hence, if ® is the figure of merit using Elmore’s rise-time 
definition, 


(64) 


oe 4(C, + Cy) ( 2a” 
VC) V\2a —b 
The maximum possible value of b consistent with monotonic 
response is b = a*, which corresponds to the critically damped 
condition of two coincident real poles. Writing ®,,,. for the 
maximum © obtainable with monotonic response, we have 


_ Cy + G) 
Pmax = HCC) V7 


which reduces to 4/2 for equal input and output capacitances. 
If these capacitances are unequal, the figure of merit would be 


(65) 


(66) 


somewhat greater, e.g. 6% and 15% larger for capacitance ratios — 


of 2: 1 and 3 : 1, respectively. 
If f,(t) is the transient response normalized to unity at infinite 
time in the critically damped case, then 


f,(t) = 1 — [1 + GV(C,C,)] exp [— 1GV(C,C.)] (7) 
where G is the d.c. gain,* from which the 10%-90% rise- 
time is found to be 3-36 G/(C,C,), corresponding to a 
figure of merit, using the 10%-90% rise-time definitions, of 


0-654 (Cy + C)/\/(C,C,) or 1-31 for equal input and output 
capacitances. 


(3.2) Maximum Value of Figure of Merit Obtainable with Two 
Poles and One Zero 


The optimum relations are derived in eqns. (26) and (28), 


namely 
1 (k,O)p +a 


— + —_— 
(212)opt = © p+ lap +b 


withO< k,C <0°5. 
Eqn. (26) gives 


Te is 4aq* (k,C)? yy 
or ee b (68) 
and hence 
9) os 
® = (69) 


~ 4/[4at — (KCB? — 20°] 


Inserting the maximum value of b consistent with monotonic 
response (=a?) and the maximum of k,C(=0°5) in eqn. (69), 
we have 


Dox = y= 1°51 : 


* G has this significance throughout the remainder of the paper. 


(70) 


Sa 


Saar ea 


TS a 
PS 


Sap 


Sees 


SFE ee = 


a 
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By taking 5C slightly less than 0-5 a fi it sli 
gure of merit slightly 
less than Al/ 7 could be obtained without degeneracy in the 
corresponding network, realized in Fig. 3. 


The normalized transient response in the limiting case, f,,(t), is 
given by 


f(t) = 1 — (1 + f2GC)_e-116C (71) 


from which the 10%-90 & rise-time and the 10%-90% figure of 
merit are found to be 3-07GC and 1 43, respectively. 


(3.3) Maximum Value of Figure of Merit Obtainable with Three 
Poles and no Zeros 


Here the appropriate maximum gain function, given by 
eqn. (48), is : 
B/2 


+ 
~ C(p3 + ap? + Bp + y) 


(Z12)opt eam 


provided that «8 > 2y. 
For this function, 
TR  f? — day 
Qa y? 


(72) 
and hence 
1 


VC — 2oey/B?) 
and the maximum value of 2 /6* consistent with monotonic 
response will yield ®,,,,... 

Consider first the case of z;, having three real poles; this will 
always result in monotonic response. The function 2«y/A? will 
have its maximum value, 4%, when the three real poles are 


coincident. 
Thus O72. = 4/3 


for three real poles. . 

The other case to consider is that in which z,, has one real 
pole and the other two are conjugate complex. The limiting 
condition for monotonic response then is that the real parts of 
the poles should be equal, giving 


ope) et B?/A?) 
p> (3 + B2/A22 
where A is the real part of the poles and B is the imaginary part 


of either of the conjugate complex pair. The maximum of the 
function 2a |B? is now #, given by B = A, and hence 


@ 


(73) 


(74) 


(75) 


Dnax = 2 (76) 
for one pole real and two conjugate complex. 
The corresponding normalized transient responses are 
f,(t) = 1 — e—3426CT] + 3t/2GC + 4(3t/2GC)7] . (77) 
and f,(t) = 1 — e~"@C[2 — cos (t/GC) + sin (t/GOQ)] . (78) 


from which the 10% — 90% rise-times are, respectively, 2°81GC 
and 2:51GC, yielding corresponding figures of merit, calculated 
using the 10% — 90% rise-time definition, of 1-56 and 1-75, 
respectively. 

It is of interest to consider whether the value of B/A giving 
the maximum figure of merit calculated on an Elmore rise-time 
basis also yields the optimum figure of merit using 10% — 90% 


rise-time. For general values of B/A the normalized transient 
response to a step-function drive, f,(t), is given by 

1 3D ot 
Bo 1 — pice? ( (De ee 


34D? Dt . (3+D? Dt 

2 Ea eas 

la + D’) cos (; +p ae) pln G + D2 26C 
where D = B/A. 


| (79) 
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From this the 10% — 90% rise-time and corresponding figure 
of merit can be calculated for various values of B/A. Fig. 5 shows 
the variation in figure of merit over the range 1 < B/A < 10; the 
maximum is 1-96, given by B/A = 2-1. This represents an 11% 


FIGURE OF MERIT 


BIA 


NORMALIZED RESPONSE 


35 


2:0 25 


1 3-0 
t/2GC 
Fig. 6.—Two transient responses with three poles and no zeros. 


(a) Coincident real poles. 
(b) Collinear complex poles, B/A = 2. 


increase over the value obtained by assuming that the value of 
B/A which optimized the figure-of-merit on the basis of Elmore’s 
rise-time also optimized that calculated on the 10% — 90% rise- 
time definition. Fig. 6 compares the transient response for 
coincident real poles with that for B/A = 2. 
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February, 1958.) 


SUMMARY 


The paper considers a high-voltage impulse-breakdown test pro- 
cedure consisting of a series of impulses of increasing magnitude 
leading to breakdown. The results expected from such a test are 
compared with the variation with voltage of the probability of break- 
down for a single pulse. 


LIST OF PRINCIPAL SYMBOLS 


P(V) = Probability of breakdown occurring as the result of the 
application of a single impulse of peak value V. 

V, = Mean breakdown voltage, ie. that voltage for which 
ROA Ono 

S,; = Standard deviation of the distribution of P(V) with 
voltage. 

Q(V) = Probability of breakdown occurring for the first time 
as the result of the application of the last impulse 
of a series of impulses increasing in regular steps to 
a voltage V. 

V, =Incremental voitage difference between successive 
impulses in an increasing series. 

V, = Estimate of the mean breakdown voltage obtained from 
tests with series of increasing impulses. 

S, = Standard deviation of the distribution of Q(V) with 
voltage. 


(1) INTRODUCTION 

When a high-voltage impulse is applied to an insulating gap 
there is a certain probability that failure, or breakdown, of the 
dielectric will occur. This probability increases from zero to 
unity as the voltage is raised, and the mean breakdown voltage 
of the gap is defined as that voltage for which the probability of 
breakdown is one-half. If the mean breakdown voltage is to be 
measured experimentally, it is sometimes convenient to use a 
test procedure in which series of impulses of a given waveform 
and of regularly increasing magnitude are applied to the gap 
until failure occurs, since in this way the maximum information 
may be obtained from a limited number of breakdowns. Tests 
on the impulse breakdown strength of transformer oil or other 
liquid dielectrics are often made in this way. The results 
obtained with such a procedure, however, can be shown to be 
dependent on the value chosen for the incremental voltage 
difference between successive impulses in the series, and thus must 
be interpreted with care.* The paper considers the relationship 
between the results to be expected from experiments of this type 
and the variation with voltage of the probability of breakdown 
occurring as the result of the application of a single impulse. 


(2) GENERAL ANALYSIS 


The probability, P(V), that breakdown of a gap will occur as 
the result of the application of a single impulse of peak value V 


* Lewis, T. J.: ‘The Statistical Basis of Impulse Testing’, Proceedings I.E.E. 
Monograph No. 249 M, July, 1957 (105 C, p. 27). . J , 
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Fig. 1.—Variation of the probability of breakdown with voltage. 


(a) For a single impulse. 
(6) For a series of increasing impulses. 


has a distribution similar to that shown in Fig. 1(a), and the 
mean breakdown voltage, V;, is defined as the voltage for which 
P(V) = 0-5. However, when a series of impulses of increasing 
magnitude are applied to a gap, the probability that breakdown 
will occur for the first time at a particular voltage level is depen- 
dent on whether or not breakdown occurs at the lower levels. 
Thus the probability, Q(V), of breakdown occurring for the first 
time at a voltage V under these conditions is equal to the 
probability of breakdown occurring as the result of the applica- 
tion of a single impulse of magnitude V multiplied by the product 
of the probabilities of breakdown not occurring for each of the 
preceding impulses. Thus if the peak values of successive 
impulses in the series are increased in regular steps equal to V; 
from an initial value V,, the probability that the first breakdown 
will occur as the result of the application of the Nth impulse is 


N—2 


QY) = PCY) TI. fee Aye 


a 


where V=V,+WN—1)J, tke ce ee 
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Since P(V) varies from zero to unity as the voltage is increased, 
1 — P(V) must fall to zero for large voltages, and Q(V) as 
defined by eqn. (1) must tend to zero for voltages which are 
considerably below or above the mean breakdown voltage. If 
Q(V) is evaluated for any given incremental voltage V, and a 
range of initial voltages 0 << V, < V,, a continuous distribution 
is obtained similar to that shown in Fig. 1(b). This curve 
represents the results that would be obtained from an infinite 
series of tests under these conditions using series of increasing 
impulses leading to a breakdown. The mean voltage of the 
distribution, V3, is the estimate of the mean breakdown voltage 
obtained in this manner, and does not necessarily coincide with 
V,; it is given by 


| Q(V)Vav 
V, = SS . . . ° . (3) 


J ‘av 


The variance (square of the standard deviation S,) of the 
distribution is given by 


| anw — nyav 
Sz = — (4) 
i Q(V)dv 
0 
The relationship between V,, V2 and V, is given by eqns. (1)-(3), 
and may be evaluated for any given distribution of P(V). 


(3) RESULTS WITH P(V) NORMALLY DISTRIBUTED 
The variation of P(V) with voltage obtained in breakdown 
tests often approximates to the normal distribution function, or 
integrated Gaussian distribution, given by 


i (v 


1 2 
ROA Sa/ Gn) j exp —-~~—_ dv. . @) 


=I 


LOG ig Q(V) 
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ie) 
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Fig. 2.—Variation of Q(V) with voltage. 
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Fig. 3.—Difference in means of the distributions of Q(V) and P(V). 
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Fig. 4.—Ratio of the variances of the distributions of Q(V) and P(V). 


provided that P(V) = 1 when V + ©, where V, is the mean 
breakdown voltage as previously defined and S, is the standard 
deviation of the distribution. 

With this distribution of P(V) assumed, several plots of Q(V) 
have been computed from eqn. (1) using published values for the 
normal distribution function.* Some of these are shown in 
Fig. 2, from which it is possible to obtain the relationship between 


* LINDLEY, D. V., and Miter, J. C. P.: ‘Cambridge Elementary Statistical Tables’ 
(University Press, Cambridge, 1953). 
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V, and V, in terms of either S, or S. In Fig. 3 the difference 
between V, and V, is plotted against the incremental voltage, 
V,, where both are expressed in terms of S;. In Fig. 4 the ratio 
of the variances of the distributions of Q(V) and P(V) are plotted 
for various incremental voltages. 

These curves have been calculated on the assumption that the 
values of the initial impulse, V,, in each series are distributed 
uniformly in the rangeO << V,< V,. If the initial voltage is not 
controlled in any test, the results obtained may differ from those 
plotted in Figs. 3 and 4, although the difference will not exceed 
a few per cent over the range of incremental voltages given by 
V,<.S>, provided that V;—V,>4S;. In any particular experi- 
ment an error may also arise from the limited number of results 
obtained; furthermore, the theory does not take into account the 
scatter introduced by inhomogeneities in the test specimens. 

The results summarized in Fig. 3 show that the estimate of the 
mean breakdown voltage, V>, may differ considerably from the 
mean breakdown voltage V,;. If, for example, in a test which 
consisted of the application of series of increasing impulses, the 
voltage was raised in steps equal to 2% of the estimate obtained 
for the mean breakdown voltage and the standard deviation of 
the results was 10% of this voltage, the estimate of the mean 
breakdown voltage would be 18 % below the voltage which would 
give a probability of breakdown for a single impulse of one- 
half. The standard deviation of the probability of breakdown 
for a single impulse would be 16% in this example, which is 
60% greater than the value given by the experiment. 


(4) CONCLUSION 


It is seen that the estimate of the mean breakdown voltage, 
V>, obtained by means of a test procedure employing series of 
increasing impulses may differ from the mean breakdown voltage 
defined in Section 1. Furthermore, the variance of the results 
obtained in such tests is seen to differ from the variance of the 
distribution of the probability of breakdown with a single 
impulse. It is therefore obvious that the results from impulse 
breakdown tests of this type must be interpreted with care. 
Moreover, when comparing the results of tests it may be desirable 
to take into account the effect of the test procedure. The 
relationship between these parameters has been calculated for 
the case where the probability of breakdown with a single impulse 
follows the integrated Gaussian distribution. 

It should be noted that, when the incremental voltage is 
approximately equal to the standard deviation of the results 
obtained, the differences between V, and V;, and between S, 
and S$), are small. If, therefore, the incremental voltage is not 
predetermined by other considerations, the best estimate of the 
mean breakdown voltage will be obtained by the use of impulses 
increasing in steps equal to the expected standard deviation of 
the results. 
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THE ECONOMIC LOADING OF TRANSMISSION SYSTEMS 
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SUMMARY 


In several publications Kron, Kirchmayer and others have out- 
lined methods for transforming power-system operating data in terms 
of complex voltages and currents into information concerning real 
generator powers and transmission losses. These losses are related to 
the individual generator loadings by a set of constants which are 
dependent only upon operating conditions at normal load. 

In the present paper Kron’s methods for establishing a transmission 
loss equation for a hypothetical 3-generator, 2-load system are described 
and the analysis is extended for application to an actual section of the 
British network. 

The coefficients of the loss equation are obtained from the basic 
impedance matrix of the system and various transformation matrices, 
which transform the basic matrix to the final loss matrix, based on 
certain operating assumptions. 

A method is developed for combining the loss formula with station 
fuel costs for economic system operation. The resulting loading 
equations are illustrated in nomograph form as an aid to system load 
dispatching. 

In the paper, owing to certain limitations of the required information 
regarding system loads, certain of the load-flow conditions have been 
estimated by calculation, and the accuracy of the loss analysis is 
correspondingly limited. However, the purpose of the paper is to 

. illustrate available methods and not to formulate an accurate set of 
loading equations. 


LIST OF PRINCIPAL SYMBOLS 


FE!, 22, etc. = Load currents. 
TIE = Total system (or hypothetical) load current. 
1,, 1, etc. = Ratios of individual to hypothetical load 
currents. 
Z11, Z>, etc. = Impedance matrices for reference frames 1, 
DMCC: 
C}, C3, etc. = Transformation matrices between quantities in 
reference frames 1 and 2, 2 and 3, etc. 
V,, V>, etc. = Terminal voltages of generators le P25 ie 
Vis, Vip, etc. = Voltages at loads Lj, Ly, etc. 
Zn.x = Measured leakage impedance of system net- 
work between generator or load x and 
generator or load K. 
Zo.G = A matrix of measured network self impedances 
seen from generating points of entry. 
Zz, 1, = A matrix of measured network self impedances 
seen from load points of entry. 
Z1.¢ = Measured mutual impedances with generator 
points energized. ; ; 
Zo. = Measured mutual impedances with load points 
energized. 
I, [2, etc. = Generator currents. 
Zn—K = Complex impedance components for impedance 
matrix 233. 
6,, 85, etc. = Angles by which generator terminal voltages 
V,, V2, etc., are referred to a common axis. 


Correspondence on Monographs is invited for consideration with a view to 


PO Nicholson is with Lever Bros., Port Sunlight, Ltd., and Mr. Lynn is in the 


Electrical Engineering Department, University of Liverpool. 


Via. Vg, = Direct and quadrature components respectively 
of voltage vector V4. 
iK, [’K — Direct and quadrature components respectively 
of current vector J. 
J4¢K — Components of current vector J. 
A, = Ratio of reactive to active power for generator 
n at normal load. 
|Viflo = Terminal voltage of generator n at normal load. 
M,x = General term of loss matrix Z¢. 
Bix = General term of loss matrix using simplified 
analysis. 
Rn_x = Real components of impedance matrix Z33 used 
in final loss matrix. 
L,, = Incremental transmission loss at generator n. 
Px, Ox = Active and reactive power supplied by gene- 
rator K. 
P; = Total system transmission losses 


=) > Pane ee 


n K 
S, = Cost of fuel input to station n, £/hour. 
A = Incremental cost of received power, £/MWh. 


(1) INTRODUCTION 


For optimum operating efficiency of large power systems it is 
necessary to co-ordinate generation on an equal incremental fuel- 
cost basis with the incremental cost of the transmission line 
losses. Extensive research has taken place in America into 
methods of combining these costs by the application of transmis- 
sion-loss formulae. These express the transmission-line losses 
as functions of the generator and interconnector power and a 
set of constants. The loss equations, once obtained, are 
applicable for any condition of generation and load, if the 
simplifying assumptions made in deriving the equations are valid 
for all system conditions. 

Following the Steinberg and Smith! publication in 1943 on 
the economic loading of power plants, a method for expressing 
total transmission losses in terms of generator power was 
pioneered by George? in the same year. Ward, Eaton and 
Hale* extended George’s original methods, and provided a more 
generalized and analytical approach to the derivation of a loss 
formula, by use of the a.c. network analyser. George, Page 
and Ward? co-ordinated transmission losses and fuel costs by 
the application of a loss equation containing constants derived 
from a network analyser study. 

Kron,> in 1951, derived a loss-equation using tensorial 
methods, requiring considerably fewer measurements and cal- 
culations. Kirchmayer and Stagg, using Kron’s methods, 
obtained a transmission-loss formula for the American Gas and 
Electric system and investigated the effects of Kron’s simplifying 
assumptions on the accuracy of the loss formula. Kron,’ in a 
second paper, considered the existence of off-nominal turn 
ratios and their representation on an a.c. network analyser with 
auto-transformers. He followed this by two further publica- 
tions,!°.'3 in which he considered the study and co-ordination 
of several interconnected transmission systems, by combining the 
solutions of small components by a series of transformations. 
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Imburgia, Kirchmayer, and Stagg!® have described a computer 
for use in system load dispatching. The computer calculates, 
from the loss constants of a system, transmission loss penalty 
factors which are used in conjunction with an incremental fuel 
cost slide-rule for obtaining economic balance between generating 
stations. Operation of the slide-rule and the computer provides 
a method of combining transmission losses and generation costs, 
and of applying them to the loading of a system under rapidly 
changing conditions. 

In Part 1 of Kron’s work, six basic reference frames are 
established for solving steady-state power-system problems, and, 
in particular, for determining total and incremental transmission 
losses. 

The transmission-loss formula to be derived, as in other 
methods, involves the generated power of all sources and a set 
of constants. These constants (self- and mutual-impedances) 
once established are suitable for use under any operating condi- 
tions, within the limits of the operating assumptions, unless a 
physical change in the system takes place. The constants are 
obtained from a.c. network-analyser data or by analytical 
methods. The use of tensor algebra provides a method of 
transforming operating data (complex voltages and currents) into 
information concerning real powers, losses and a set of constants 
by means of a series of operations called ‘transformations of 
reference frames’. 

The object of the present paper is to relate the performance of 
power systems to a set of linear equations, containing real 
generator powers P and incremental losses Z incurred in the 
transmission system. These equations are of the form L = M. P, 
where M in matrix form represents a set of real constants which 
are dependent only upon operating conditions at normal load. 


(2) STUDY OF KRON’S ANALYSIS 
(2.1) System Operating Assumptions 


The methods outlined by Kron for determining a transmission- 
loss formula involve certain fundamental concepts of tensor 
analysis, and a number of assumptions concerning the operation 
of a power system. These assumptions are as follows: 


(a) The ratio of each load current to the total load current 
of a system, at normal load, remains constant as the loads vary. 

(b) The generator currents remain fixed in phase angle relative 
to each other as the generator loads vary. 

(c) The generator voltage magnitudes remain constant. 

(d) The ratio of reactive power to active power of each source 
remains constant. 


(2.2) Reference Frames 


This present power system study will involve the introduction 
of the following reference frames: 


1. Measurement of leakage impedances Z,,. 
2. Introduction of one hypothetical load J which replaces all 


n 
system load currents, where JE = Sy JZ», 
1 


3. Elimination of J“, leaving generator currents only. 

4. Generator currents and voltages are transformed into axes, 
in phase and in quadrature with the respective generator voltages. 
Matrices of complex quantities are changed into larger matrices 
containing only real quantities. 

5. Using the assumption of constant ratio of generator 
active/reactive power, each generator current is replaced by its 
projection upon the terminal voltage existing at normal load. 

6. Active components of generator currents are transformed 
into generator powers. Since voltage differences have been used 
in the previous reference frames, the transformation matrix in 
frame 6 yields generator power loss. 
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(2.2.1) Reference Frame 1. 


Consider a hypothetical power system consisting of generators 


G,, G2, G3 supplying loads Ly, L, via a transmission network 
as in Fig. 1. 


GENERATORS LOADS 


Fig. 1.—Hypothetical 3-generator 2-load power system. 


The self and mutual leakage impedances of the system network 


are measured, usually from a network-analyser study, by: 


(i) Disconnecting the generating plant and loads from the 


network. 


(ii) Injecting unit current between each generator and a 


reference point in the system. 


(iii) Measuring voltage differences between all generating and 


load points and the reference point. 


This is repeated for measurement of load self-impedances by 


injecting unit current into the network at L, and Lp. 


The impedances of the network from a generator point of 


entry are called generator self-impedances, and transfer or 
mutual impedances to other points. Impedances measured from 
load points are termed load self- and mutual impedances. The 
actual self-impedances of the generators and loads do not enter 
explicitly into the analysis. 


ig,=]p.u. 


‘REFERENCE 
POINT 


Fig. 2.—Reference frame 1: basic measurements. 


Thus in Fig. 2, with unit current injected at G, 


the generator self impedance, Z;,; = V; 
and the mutual impedances, Z,, = V, 


431= Vir ce ae 
24.1 = Vip 


The impedance matrix for the system is therefore 
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Since the network mutual impedances form a symmetrical 
system, 


Zo.1r= (Zre)t 
where (Z L.G); 18 the transpose of matrix (Zz g). 

That is, the mutual impedances with load points energized are 
equal to those with respect to the generator sources. 
The equations for the system now take the form 

Vy —Vre Zyl + 2, oP + Zy3F + Z, I? 
Va— Vr= 2240 + 207 +Z, 314 + Zyl 
— Vr= 2341" + Z3.P + Z3 31! + Z3 412 
Vig — Va = Z441' + Z, oP + Z, 5I| + Z, I? 


If these system equations are solved by inverting the impedance 
matrix, the equivalent circuit will then be as shown in Fig. 3. 


(3) 


p qui 
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Fig. 3.—Equivalent circuit for measurement reference frame 1. 


(2.2.2) Reference Frame 2. 


Let the total system load current, during a normal load period, 
- be given by 


jE = JH + Jl2 (= hypothetical load) . . . (4) 
Then, from assumption (a), 
fH ji 
ya: = L,, Wis = lL, . . . . . (5) 
where /,, /, are constant complex ratios. 
Thus JEL == J, JE 
12 L (6) 


or [En = ChJE where Ch» = eee as) 
2 | by | 


The system with individual load currents may therefore be 
transformed into one which contains only the hypothetical total 
current J© by the transformation matrix 


Saeed 


(8) 


(Note that the generator currents remain constant.) 
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and the currents and voltages in reference frame 2 are given by 
Ti Ce A ee eee ( 

Vz = (CY, « (10) 


where (C})¥ is the conjugate of the matrix (C4) transposed. 
The system impedances in frame 2 are given by 


Zp. = (CHFZ Ch 


(11) 


The system performance is now represented by the equation 


V3 — Zaake 
i.e. V; = Vr = Zi. + Z1.217 + a,Te 
Vy — Vp = Z5,40' + Zy.9I? + alt (12) 
Vy, a Vr = bT + bI? + wl 


where in this example V,, the hypothetical load voltage, is the 
weighted average of the 2-load voltages; 


ne: Vr = it Vi ae LF Vi2 (13) 
4 2 4 Pe 
and =D D2 Kha, = DY D Zo.Kbn 
K=3n=1 K=3 n=1 
(14) 


4a A 2 

b= SS » Zxslnb2 = a >> Zraln 
K=3n=1 K=3 n=1 

w= > > Dd Z as given above 


The system may now be represented as in Fig. 4. 


Z22 


HYPOTHETICAL 


r=-1? 
REFERENCE 
GENERATOR 


Fig. 4.—Equivalent circuit for reference frame 2 with one 
hypothetical load. 


The system losses are now given by I?*Z ,1* = I?*V,._ How- 
ever, this method of determining losses is not desirable since it 
involves the difference between large quantities, namely between 
generator and load powers. 
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(2.2.3) Reference Frame 3. 


In this reference frame, the hypothetical load current is 
eliminated by introducing a transformation tensor C from a 
known equation of constraint. Now 


P+P+Pr+F=0 
=a —-fP—-P—-Pe—2XT). 


(15) 


Since the generator currents J! and J* remain unchanged in 
this reference frame, the required transformation matrix to 
replace the equation of constraint is 


Therefore 


(16) 


The currents and voltages in this reference frame are now 
given by 


[ts 713 (17) 
V3 = (CPV, (18) 

and the system impedances by 
Z33 = (C))PZ22C3 (19) 


The impedance matrix Z33 is asymmetrical and complex. 
The performance equations obtained from V3; = Z33/° are 
ha Vp= Gin bi ea + WE 
(Ai ot 6 = ia We ae 
Vo — Vp = (Za — 81 — a + wl 
+ (Zh... — b2 — a + wl? + (w— a) 
V; Vip =(w — bd)! + Ww — 6)? + wi3 


5 (20) 


The voltages V3; represent the potential differences between 
generators and hypothetical load with currents J? entering and 
leaving at these points. 

The components of the system equations are represented in 
Fig. 5. It is evident from this Figure that the load voltages and 
currents do not now exist in this reference frame 3. 


Z33 


HYPOTHETICAL 
LOAD 


Fig. 5.—Equivalent circuit for reference frame 3. 


The equation J°*. V3 represents the various losses in the 
system with reference to the generator currents and the voltage 
drops between the generators and hypothetical load. It does 
not involve the difference of large quantities. 


(2.2.4) Reference Frame 4—Change of Axes. 
For the hypothetical system under consideration, the generator 


and load terminal voltages and currents for a normal load period 
are illustrated in Fig. 6(a) and 6(5). 
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La HYPOTHETICAL 
LOAD 


(>) 


Fig. 6.—Vector diagrams for the hypothetical system at normal load. 


(a) Normal load generator voltages and currents. 
(b) Load voltages and currents at normal load. 


The generator terminal voltages V;, V2 and V3 are referred to 
a direct axis along the direction of the total load current, by 
angles 6,, 6, and @3, respectively. The generator currents lag 
behind the terminal voltages by the respective angles ¢,, ¢, and 
$3. The load voltage and current conditions at normal load 
are illustrated in Fig. 6(6). 

A set of axes d’ and q’ is now introduced, along and at right 
angles to the generator terminal voltages. The generator currents 
projected upon the respective normal load terminal voltages will 
then represent the generator powers. 

In this new reference frame, the active and reactive power 
components of each generator current J, will be given by, 

I’ = J? + ja’ (21) 

The effect of rotating the axes d and q to new axes d’ and q’ is 
obtained by rotating the current vectors in the opposite direction, 
through an angle —0, so that 

b= ef. (22) 

Thus, for all generator currents, the transformation matrix 

required for quantities in reference frame 4 is 


1 2 3 


(23) 


The new impedance matrix is now given by 
Z44 = (C)FZ33C} 
and the system equations take the form 


Ky, = Zale 


(24) 
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3 
or [Vn] — Viel = SY Z,_ ei On— On] Ke—i0x) (25) 
K=1 


(n = 1, 2, 3) 
where Z,,_ x is the general term of 2333 


or 
3 
| V,,| i V pein i a ee COs G,, ae Ox) oars AG eK sin 6, - 9x)] 


+ H[Xn—x 608 (8, — Ox) +ER, x Sin (On — O,)]\UKex) 
(26) 


where (Rz_x +jXn_x) represents the components of Z,_ K: 
The voltage vector V, represents the voltage drop from generators 
to hypothetical load. 

Any complex impedance, say Z = R + jX, may be replaced by 
a matrix containing only real numbers of the form 


d q 
alii 18 —X 
Z= (27) 
GuimaeXa R 


This introduces direct and quadrature axes d and q, in which 
‘ne voltage and current vectors are represented by in-phase and 
quadrature components. 

Thus if J = [4 + j/7 and V = Vz + jV,, then, in matrix form, 


Ta pireval 1 Se 


diel? Gueve 


(28) 


Thus the complex equation V, = Z,,I4 may be expressed in 
terms of real quantities by replacing the respective matrices by 
others containing twice as many equations and variables, but 
containing all real components. 

For the 3-generator system, with the impedance Z4,4 expressed 
in real numbers, the general equations for system performance 
will take the form 


V = > [Rn—x 00S (On — 9x) —Xn_x sin (0, — Ox )f* 
eee 

— SS [Xx 008 (Oy — Ox) + Raa sin Oy — Ox) 
ne » . (29) 
Von = Oe [X,—x 608 (On — 9x) + Rn—x sin (0, — Ox) I* 
(= 1, 2, 3) 


a= > [Rn—K COS (6, —9x) —Xn_—K Sin ey 
Kea 


(2.2.5) Reference Frame 5. 

In this frame the reactive components of the generator currents 
are expressed in terms of the active components in phase with the 
normal load terminal voltages. 


Using assumption (d) of Section 2.1, the components of the 
various generator currents may be expressed in the form 


J4 = [a 
| EW Gs | on 


where the constant A is given by the ratio of the normal load 
measurements, /9’/I¢’ = tan ¢, (in Fig. 6a) = Q/P, Q and P 
being the reactive and active power components respectively, of 
each generator at normal load. 

Such a set of equations for the components of all generator 
currents in the 3-generator system will produce the transforma- 
tion matrix 


dine was. “ds 


i.e. It = Cis 


(31) 


The new impedance matrix, containing only real numbers, is 
now given by (C4),Z44C4, and the system equations are given 
by the general term 


Ce a DG) ei 
(n = 1, 2, 3) 


3 
oO AER cos (4, — Ox) — X,_x sin (0, — Ox) Cd +A,Axg) 


+ [Xx 008 (On — Ox) + Rn—x sin Gn — Ox)\(An — Ay} 
x U4® + Axle) (32) 


(2.2.6) Reference Frame 6—Transformation of Generator Current into 
Power. 
By assuming that the generator terminal voltages remain 
practically constant in the region of the normal load values, the 
active power of each generator will be given by 


Py = FV |Vilgp A = 1 |Vilo - Pi 
P> — jé2’ | Volo or jar — 1/| Valo Py . (33) 
Pye ail oe Vg ga ee ae I/|V3|o P3 


Thus the required transformation matrix for quantities in this 
reference frame is 


(34) 


The final impedance matrix is given by 


Ze6 = (C2, Z5sC2 (35) 
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and the system equations are given by 


Kat I's iC 3 1 
— an oo zak! a as R 9 4 
|Valo K=1 TARLARL n_K COS (Gy ey 


a An—K sin (6, = Ox)\C Gia A,Ax) 
+ [X,_x« cos (0, — Ox) + Rn_x sin (6, — Ox) (An — Ax)}Px 


3 
or = > MxPx. (36) 
K=1 


where Pr = (AK + Agl®®)| Velo 


is the real power supplied by generator K and M,x represents a 
set of real constants. 


HYPOTHETICAL 
LOAD 


Fig. 7.—Equivalent circuit for the final reference frame 6 with active 
generator powers. 


Fig. 7 illustrates the equivalent circuit with real impressed 
generator powers. 
The final loss equation is now of the form 


Va= Zeck (37) 


The dimension of /° is that of power, and it represents the 
active power P supplied by each generator. 

The term V5 = (C2),V; is dimensionless owing to the 
inverse voltage form of C2. Its components are fractions, say 
L, and represent the incremental J?R transmission losses of the 
system, d loss/dP. 

Zo tepresents the final ‘loss’ matrix containing real components 
Myx of inverse power form Z/V 2. 

Thus, in terms of generator powers and incremental J2R losses, 
the system equations take the form 


Ln a >} MixPr (38) 
K 


which is equal to the incremental transmission loss at generator n, 
d losses/dP,,- 


The total J?R losses in the system are thus given by 
Prosses a > p2 P,rMixPxK : (39) 
n 
(2.3) Simplification of the Analysis for Total and Incremental 
Loss Studies 


For incremental loss calculations only the real parts of the 
differences of potential existing in the basic measurement 
reference frame 1 need to be measured. It is also sufficient to 
use only the real components of the matrix C, for transforma- 
tion from reference frame 1 to 2. This latter simplification 
will produce a symmetrical impedance matrix Z,, containing 
only real components, and the former a and b components 
change to d = (a + b)/2. 

The total J*R losses in the system may be found by using only 
the symmetrical part of Z¢ containing coefficients Ree, say B. 
In this case, the total transmission losses are given by 


Prosses = py x PrBixPK and Bux = Br, (40) 
n 


B 
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Thus, for the 3-generator system, 


Piosses = By Pi a ByP3 23 B33P3 
+ 2ByP,P, + 2B,3P1P3 + 2B23P2P3 (41) 
The same result is also obtained if only the symmetrical part 
of R33 and the skew-symmetric part of X33 are used when trans- 
forming Z33 to Ze, assuming Z33 has the form R33 + JX33- 
Using the above simplifications, the general loss coefficient 
takes the form 


[cos (6, a Ox) ae A,Ax) 
a sin (6, ae) Ox) An Ta Ax)] 


= n—K 
oot le 
[Vrlol Vclo He 


(3) STUDY OF THE LOADING CONDITIONS ON A SECTION 


OF THE BRITISH NETWORK 
The Grid system for which a transmission-loss formula is to 


be determined consists of the 132kV Warrington section of the 
North West, Merseyside and North Wales Divisional Network. 


This section comprises steam generating stations at Warrington, 


Percival Lane and Ince, and 33kV Area Board load points at 
Warrington, Percival Lane, Ince, Crewe and Knutsford. Fig. 8 


WARRINGTON CARRINGTON 


(85 MW) 


8-2 MILES 
(0-012 +40°032) 


10-0 MILES 
(0-014 + j0-038) 


11-9 MILES 


PERCIVAL LANE 


(0-017 + j0'046) 


KNUTSFORD 


27:3 MILES 
(0:039 +jO'105) 


> oe 
oO 
8 o| 9 
° a} 2 
ee = ae 
+ 
= <| 8 
INCE ° w}| 2 CREWE 
(e) mn 9 ) 
(386 MW) LL = (29 MW 


26:6 MILES 

(0-038 + j0-102) 
L3 L4 

Fig. 8.—System diagram—Warrington Section. 


Base apparent power—100 MVA. 
Base voltage—132kV. 


illustrates a simplified diagram of this system with the main 
generating, load and interconnection points. The available 
active-power generation exported at each busbar is given in this 
diagram, which also shows the contribution from Marchwiel 
and part of Maentwrog at the Crewe station. The diagram 
also illustrates the route length of each overhead line forming the 
network, and the per-unit impedances for a 100 MVA base at 
132kV. 


(3.1) Load Flow Study 


The study was carried out analytically in the absence of net- 
work-analyser facilities. The first part consisted in determining 
the approximate system load flows from the known generator 
loadings and the active-power demands at the load points. It 
was assumed that there was a normal 12°% outage of the available 
generation at all stations, and the load demands used were those 
that existed in January, 1955, during the normal peak periods 
of 0700-1300 hours (see Fig. 10). 

The power equations for a short line were then used for 
calculating the various terminal voltages and angles in the given 
system. Reactive line flows were also determined from these 
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74:66-j44:28 


13-97~j5-:05 13-68 j5-35 Gs 36-4|—jl2-88 
68-0 —j42:15 rel 
N 
oe o 
< 8 
\ = 
Gy ao 
3 & 8 
2) 8 
| 
” 
© a 3 
° . = 
Aa 1 
i n 
: 0 
a 9 
KR F Ao 
oll ol- i212 
: 97-25—j56'59 ; 
43-0 - j 39:07 20-23- 7-88 
wv 
© fe) 
S m 
= ba 
E o 
~ = g 
N v ° 
& 2 
! 
e e 3 
A a 
3 3 
bd m 
(eo) = 
2 Q 
S3 Iho -ol=1°36' 
347-0 - j215:0 19:54~j5-42 19-26—-j6:06 74-0 — j35:85 
340-S1-j 204-6 25:5—j18-42 


Fig. 9.—Load-flow data for a weekday peak period, including line 
flows, generator loadings, load demands and bus voltages. 


WIRRAL AND N WALES LOAD DEMAND 


AT INCE -P, 
Panis L3 


LOAD DEMAND AT 


CREWE -Py4 


°o 02 04 06 os ite} 12 14 16 18 20 22 24 
TIME , HOURS 


Fig. 10—System load demands—17th January, 1955. 


(a) Total area load, Pz. 
(6) Wirral and North Wales load demand at Ince, P73. 
(c) Load demand at Crewe, P74. 


equations, and balance of busbar conditions in the system loop 
was obtained by a series of successive approximations. 

The results of the final approximation to system load flows to 
give the desired system balance and realistic generator and load 
reactive demands are illustrated in the load-flow diagram of 


Big. 9, 


‘4) STUDY OF THE APPLICATION OF KRON’S METHODS 
TO A SECTION OF THE BRITISH NETWORK 


(4.1) Reference Frame 1 


For the measurement of the basic impedance matrix of the 
Warrington section network, illustrated in Fig. 8, the Carrington 
“ie-line connection Gs; was chosen as reference point and was 


earthed. 


With unit current impressed at the generator G, the various 
leakage impedances are given by 
Ven 
Lie em i og a ea ae AAD) 
IGK 
and in this case reference generator current [gz = impressed 
current Ig,. 
Thus for generator points of entry, 


+ 


V, 
Z1.1 = —4 =0-0105 + j0-0280 
1G] 
V, 
Z = —22'= 0-0087 + 70-0231 
1G1 
V, 
Z3,, = —2 = 0-0073 + j0-0195 
1G1 
Vo4 * 
Th ee () ‘ 
ote ae 0-0025 +j0:0067 | ayy 


and for load points of entry, 
Z5.1 = Z,,, =0-0105 + j0-0280 
26.1 = 22,1 = 0°0087 + j0-023 1 
27,1 = 23,4 = 0:0073 + j0-0195 
23.1 = 24.1 = 9:0025 + j0-0067 


V; 
Te = = 0:0011 + 70-0029 
Gi 


With unit current impressed at generator G,, 
Z,.2 = 0:0087 + j0-:0231 = Z, 5 
Z),2 = 0:0188 + j0-0507 = Z 
Z3,2 = 0:0160 + j0:0428 = Z,, se (45) 
Z4.2 = 0:0055 + j0:0146 = Z, , 
Zo,2 = 0:0023 + j0-0063 
With unit current impressed at generator G;, 
Z;,3 = 0:0073 + j0-0195 = Z,,, 
Z>,3 = 0:0160 + j0:0428 = Z, 
23,3 = 0:0226 + j0:0600 = Z,.; se AAG) 
Z4,3 = 0:0078 + j0-0206 = Z, 3 
Zp .3 = 0:0033 + j0-0089 


With unit current impressed at generator Gy, 
Z1,4 = 0:0025 + j0-0067 = Z5.4 
27.4 — 0:0055 + j0:0146 = Z6.4 
Z3.4 = 0:0078 + 70:02006=Z,,7  - =» G7) 
24,4 = 0°0158 + j0-0419 = Z 4 
Zo.4 = 0:0067 + j0-0182 
With unit current impressed at the load L, the leakage impe- 
dances are equal to those obtained with unit current impressed 
at the generator G,. Similarly, with impressed currents at L,, 
L; and L, the leakage impedances are equal to those obtained 


with impressed currents at G,, G; and Gy, respectively. 
With unit current impressed at the load point Ls, 


Z,.9 = 0-001 1 + j0-0029 = Zs.9 1 
Zy,9 = 0:0023 + j0-0063 = Z¢.5 


Z3.9 = 0-003 3.-++ j0:0089 = Z7.9 sad ee 
Z4.9 = 0:0067 + j0:0182 = Z5.4 
Zo,9 = 0:0125 + j0-0339 J 
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The above impedances now form the components of the basic 
symmetrical leakage-impedance matrix Z;. 


(4.2) Reference Frame 2 


In this analysis the hypothetical load for a normal load period 
will be given by 


a jE aie jp2 ak [3 ae yp4 ak p55 ; (49) 
and the components of the C; matrix by 
jE! p22 [3 ji4 ps5 
Sg a beist: lee a l4= TE» ame ai Py (50) 


However, since available information for the given load points 
consists of the half-hourly active-power loading only, it is not 
possible to obtain the constant complex ratios, /;, /5, etc., in 
the above form. An assumption is thus made that the ratio of 
the active-power loading at the load points to the total active- 
power load remains constant, and approximates to the corre- 
sponding current ratios, as given above. Such an approximation 
assumes that all load voltages and power factors are equal, and 
that all load currents are displaced equally from the total load 
current. These assumptions are valid for the system under 
consideration owing to the small phase displacements between 
the terminal voltages, the previously assumed constant load 
power factors and the approximaie nominal system voltages at 
the load points. 

A similar assumption is made by George,” in which the average 
line voltage and average power factor for the heavily loaded 
portions of the systems are used in the analysis. 

Fig. 10 illustrates the half-hourly integrated loadings on 
17th January, 1955, for two of the five load points, and the total 
area load for the Warrington section network. To compare the 
general trend of each load to that of the total system load, the 
ratio of each half-hourly load to that of the total was calculated. 

It was apparent from these ratios that the trend of the total 
load was reflected at all load points; that is, the pattern of demand 
was the same at all load points, as stated in assumption (a). 

The loading ratios to be used in this analysis have been obtained 
by averaging the individual half hourly ratios, and are 


J, = 0-110, 1, = 0-108, 1; = 0-633, 1, = 0-111, /5 = 0-038 


The absence of reactive metering data at the load points 
prevented the use of complex ratios in this analysis. 
leakage-impedance matrix for reference frame 2 is now given by 


222 = (Ci)FZ4(C}) 


Z4.2— 4d, —d,+w 
= + 0:00242 


22,1 — a, —d; +w 
= + 0-002 42 


2.9 = 2d> + Ww 
= + 0-005 74 


27,3 ae d, 
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In this analysis (C})* = (C}) and only the real components of 
Z,, are considered. 
The components of Z>, are given and calculated as follows: 


Z1,, = 0:0105, Z;,» = 0-0087, Z;,3 = 0-0073, 


Zip= 010025 
Zp, , = 0-0087, Zp.» = 0-018 8, Zy,3 = 0-0160, 
22.4= 0:0055 
23.4 = 0-007 3} 23.2 SS 0-0160, 23.3 ==; 0-0226, 
Z3.4 = 0:0078 
Zs, = 00025, Zy.> = 0-005 5, Zy.5 = 00078, 
Z4.4= 0:0158 
dy = Z,.5h, + Zy.6l2 + Z1.7/3 + Z1.8l4 + Z1.9/5 
= 0:00703 
dy = Zy,5l, + Zo,6l. + Zo.713 + Zr.8l4 + Zr.9)s | 
~ 0-013 81 | 
d3 = Z3,5l, + Z3.6l2 + 23.713 + Z3.8l4 + 23.915 
= 0-01783 
dy, = Z4,sl; + Z4.6l2 + 24.713 + Za.sla + Za.0l5 (52) ‘ 
SS (HOO If - Ey 
and w is given by, | 
Zs.slilf + Zs.6hlf + Zs 7/5lf 
+ Zs. sl4lt + Z5,ol5/i — 0-000 774 


Zo sh lt + Zo.6hlF + Ze.q3lF 
+ Z6,gl4l3 + Z6.9l5l% == 0-001 492 
+ Z7,6lolf + Z7 71/3 
+ Z7,glalF + 27 o15lF 
Ze.sh lf + Ze.6lolf + Ze.q13]5 
se Ze. 8l4l4 + Z3.o15]4 = 0-000 867 
Zo, 511% + Zo6lol§ + Zo,7/3l% 
+ Zo, gl4lé + Zo,9151% = 0-000 140 


Z7, 5113 
= 0-011 285 


0:014558 = w 


(4.3) Reference Frame 3 


In frame 3 the hypothetical load current J is replaced by the 
reference generator current [® = J°, using the equation of con- 
straint given by 


23,2 at d; = dy + Ww 23,3 


(51) F=—-fp-pPp—pPp—]f*—Pf, ie. = — DIF . (53) 
The impedance matrix is given by 
The 
233 = (C})¥Z2(C3) 
and its components (R,,__x) are 
Z,,.3 —d, —d,;+w 21.4 —-d,—dy+w w—d 
= — 0:003 = + 0:00222 +0: 00753 
—d,;+w 27,4 —d,—d,+w w—d, 
= — 0-001 08 = — 0:001 56 +0: 00075 
—2d,;+w 23, —d;—d,+w w—d, 
— "+ 0-001 50 — 0-003 28 —0-00327 | > 4) 


= — 0-001 08 


—d,+w|Z4,.—d,—d,+w 


w—d, 
4 0-00753 


w —d, 


24,3 — dg 


a a3 +w 
+ 0-002 22 = — 0-001 56 = — 0:003 28 


Was 
Pact: 0% 00075 = — 0: 00327 


24.4 — 2d4+w 


Ww Se 
= + 0:01474 


+ 0:00675 


w—d, 
= +0: 00675 


w 
+ 0:01456 
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It will be noted that the components of Z;3; are symmetric, 
‘and contain only real quantities similar to the real symmetrical 
(components R3; of Kron’s analysis,> and the Kmn coefficients 
(of Ward, Eaton and Hale [Reference 4 eqn. (6)]. 


(4.4) Calculation of Loss Constants 

The general term of the loss matrix is given by 

Ru_xk 
|Valol Valo 

[cos (6, ie Ox) ae A,Ag) + sin (6, = Ox)(An —s Ax)] 

(55) 

The generator terminal voltage angles, 6, may be referred to 
_any reference axis, and in the present study this has been taken 
along the terminal voltage of generator G,. 
The constant ratio (A= Q/P) for each generator, and the 


generator terminal voltages are those obtained from the normal 
load-flow study, and are illustrated in Table 1. 


(One 


Table 1 
GENERATOR DATA FROM LOAD-FLOow STUDY 


i Generator output Generator 

MGencrator terminal : Per-unit 

i es voltage a! pees bus voltage 
i P Q angle w.r.t. Sat 

: (MW) (MV Ar) G70 


' 
Gy 74: 66 44-28 0 0-593 1 
G2 O7E25 6259 Sei li 0:°5819 
)} G3 340-51 | 204-6 A 0: 6009 
G4 25°5 18-42 —12 367 0:7224 
G5 36°41 12°88 —17’ 0:3537 


From the foregoing general term, 
1 

1f = Tp 

LAr 


= 0-011(1 + 0-593 17) = 0-01487 . 


B R,_,(1 + Ad) 


(56) 


By = pert 2) [cos (0; — 8,)(1 + A,A,) + sin(?; —8,)(A,—A,)] 
IVilol Valo 


= 0-002 42 [cos (—11’)(1 + 0-5931 x 0-5819) 
+ sin (—11’)(0-593 1 — 0-581 9)] = 0-003 255 (57) 
All the B,x coefticients obtained from the foregoing general 
term are given in Table 2. 
Thus, for the 5-generator system, the total system per-unit 
loss is given by 
Pr = 0:0149P? + 0-007 7P3 + 0-002P3 + 0:0224P} 
+ 0:0164P2 + 0:0065P,P, — 0-008 1P,P3 
+ 0:0063P,P, + 0:0182P;P; — 0:0029P,P; 
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For the station loadings obtained from the load-flow study, 
i.e. for 


P, = 74:66 MW, P, = 97:25 MW, P; = 340-51 MW, 
P, = 25-5MW, and P; = 36-41 MW 


the total system loss is, Py = 0:006227 per unit or 0-623 MW. 

These losses do not compare favourably with those obtained 
from the load-flow study, the results of which are illustrated in 
Fig. 9. However, the main purpose of the flow study was for 
the determination of voltages, angles and Q/P ratios. Excessive 
errors in the losses estimated from Fig. 9 are the results of finding 
differences of large quantities. 


(5) CO-ORDINATION OF INCREMENTAL FUEL COSTS 
3 AND TRANSMISSION LOSSES 

In this Section the previously determined transmission losses 
of the system are combined with the incremental fuel costs of the 
generating stations, and a loading schedule is obtained which 
will give minimum operating costs for given total generation. 

In the case of a number of generating stations supplying a 
power system and loaded on an equal incremental fuel cost basis, 
power will be transmitted from low- to high-cost regions due to 
the variation of fuel costs at different stations. For economic 
division of load between the stations it is thus necessary to con- 
sider the resulting transmission losses, and to amend the station 
operating costs accordingly. 


(5.1) Methods of Co-ordination 


The mathematical analysis for co-ordinating incremental fuel 
costs and transmission losses is based on the methods for deter- 
mining the maxima and minima of a function of two variables, 
the latter also being related by an equation of constraint. 

Using the methods of Lagrange’s undetermined multipliers 
(Courant,>* Kirchmayer and Stagg’), the condition for minimum 
fuel input is given by 

dS; 


Pr 


where dS,,/dP,, = incremental fuel cost (£/MWh) of station n. 


(59) 


| 


and 0P,/dP, = incremental transmission loss (MW) for a 
megawatt change in generation (MW/MW) at 
station 7. 


Solution of the non-linear simultaneous equations obtained from 
eqn. (59) for each station, by variation of A, will yield the plant 
schedules for different total system loadings. 

If, in the general case, the fuel cost input curve for station n 
is assumed to be of the form 


— 0:0044P,P, + 0:001 8P,P; — 0:0094P3P, Sy = m,P2 + C,P,(£] br.) (60) 
— 0:0079P3P; + 0:0168P4Ps5 (58) ds 
; : : then ——" == (2m,)P, + Cy, = m,Pn G (61) 
where P;, P>, etc., are the per-unit station loadings. dP, 
Table 2 


COMPONENTS OF FINAL Loss MATRIx 


11 
+0-014 87 
Dy 


1 
+0-003 26 
24 


2) 


—0-002 21 +0-000 91 


33 44 55 
+0-00204 +0-022 43 +0:016 38 
14 5 


i 
+0-009 12 
35 
—0-003 96 


—0-004 69 +0-008 42 
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where m,, = slope of incremental cost curve (£/MWh/MW). 


and _C,, = intercept on incremental cost scale (£/MWh). 


oP, 
Se DS lye 
Also YP, x K 
Thus eqn. (59) becomes 
(MP, + Cy) +A 2BaxPe=A. . - (2) 
K 


For the five generator system under consideration, the co-ordi- 
nation equations are 
m,P, + A(2B,,P + 2B iP 25 2B13P3 4 2B,4P4 } 
= 2Bi5Ps)= A = (Gi 
mP, + A(2By2P, + 2By2P2 + 2Bo3P3 + 2Bo4P4 . (63) 
etc. 
Available information, however, concerning station fuel costs 


will take the form 
Sp= 1, PPeur Cw a eer Ee) 


where m, and C, now relate to the slope and intercept of the 
input/output curves. 


Thus 


In this case it is now possible to charge the system losses at 
the incremental rate of received power A, and to obtain a loading 
schedule from the solution of linear equations. 


(5.1.1) Co-ordination Equations for a Five-Generator System. 


In terms of the actual B,, constants and linear input-output 
curves, the co-ordination equations simplify to 


0-0298P, + 0-006 5P, — 0-008 1P; + 0-006 3P, 
1 m 

OVI pe 5 ee 

+ 0-018 2P; aa. 


0-006 5P; + 0-0154P, — 0-0029P; — 0-0044P, 
1 m 
02001 3Pa = (fe 
+ 0-001 8P; ae ' 

— 0-008 1P, — 0-0029P, + 0-004P; — 0-0094P, 


1 m (65) 
0-007 9P eng \ fe 2 
ous roi! A 
0-006 3P, — 0-0044P, — 0-0094P; + 0-044 8P, 
1 m 
+-0-0168P = a0 Bee 
r Pmni00 } A 
0:0182P, + 0-001 8P; — 0-0079P; + 0-016 8P, 
1 m 
+ 0-0328P = sl ie ils 
5 100 : r 


(P;, Po, etc., are per-unit station loadings.) 


or, in matrix form, BP = (1 Ee ) 


The solution for the station loadings will now be obtained by 
inversion of the 5 x 5B matrix. This is calculated using the 
methods of Kron,”°, p. 258, for the inverse of a two-row com- 
pound matrix. 


Such an inversion yields the matrix equation 
m 

poe B11 = -) 
A 


This gives the respective per-unit loadings for minimum total 
fuel cost as follows: 


+ 


P, = 46-922 — (5-699, + 4-730m, + 29:965m, 
+ 5-600, + 0-928ms) 


Py= 45-635 — (4° 730m + 5:486m, + 28-758, 


+ 5°452m, + 1-209ms) 
1 


P3 = 273-446 — 
; A 


(29-965, + 28-758m, + 174:902m; 
+ 32-583m, + 7:238ms5 
; [ . 6) 
Pj = 5i-1si— gO UO + 5:452m, + 32:583m3 
+ 6:362m, + 1-184ms) 
P, = 11-420 — 510928 + 1:209m, + 7:°238m3 
+ 1-184m, + 0-861) 


UP = 428-604 — (46:922m, + 45-635, 


+ 273°446m, + 51-181m, + 11-420ms) | 


Loading Schedule. 


From the weekly return of fuel costs at steam stations con- 
tained in Form CEA/G.S.-15 of the Central Electricity Authority, 


the following incremental costs were estimated, and they are used — 


for illustrating the application of the above loading equations. ~ 
These are assumed figures only, based on average fuel costs and - 


existing orders of merit. 


mM, My M3 M4 Ms 
0-5 0:6 0-4 0:8 0-35 pence/kWh 
2°08 2:50 1:67 3:33 1°46 £/MWh 


Under these conditions the loading equations reduce to the 
following form: 


P= 4692. 
A 
Ppt AS 6350 
r 
Plo se 
(67) 
P, = 51-181 — 102:606 
A 
Puasa lsd) ees 
A 
XP = 428-604 — “= 


The respective station loadings are now obtained by deter- 
mining the value of A for a summated generation (XP), and 
applying this same value of A to each remaining equation. 

For per-unit values of © P in excess of 4-316, corresponding to 
a value of A equal to 2-016 2£/MWh, it becomes necessary to 


base-load the available generation at G4, say equal to the maxi- 
mum per-unit value of 0:29, 
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STATION LOADINGS , Py ». MW 


° 
250 350 450 SSO 
TOTAL SYSTEM GENERATION, =P,MW 


Fig. 11.—Generator loading schedule for optimum system economy. 
Gy, base-loaded at © P = 4-316. 


(5.1.2) Co-ordination Equations with Generating Station G4 (Crewe) 
Base-Loaded at its Maximum Available Generation of 
29 MW. 
With G, base-loaded at 29 MW, the co-ordination equations 
(65) can be reduced. Inversion of the B matrix will again give 
the solutions for generator loadings in the form 
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Loading Schedule. 


Using the previously assumed incremental fuel costs, and for 
values of XP in excess of 4-316, the loading equations now take 
the form 


Pi = VusyvS — 

Py = 2-0178 — — 

Py = 12-7805 — == . (68) 

Pe e1:940 Fee 
299302 


UP (including Py = 0:29) = 19-1557 — 


A complete loading schedule obtained from these equations 
for system loads, ranging between 250 and 550 MW,, is illustrated 
in Fig. 11. 

The accuracy of the generator loading equations has been 
studied by checking the inversion of the B matrix, eqn. (65), from 
the product BB~!. This was found to be of approximate unit 
matrix form, thus indicating that the inversion of B is correct. 

An investigation has also been carried out to check the scaling 
of this matrix. From the study, which consisted of checking 
each step in the inversion of B, it is apparent that the expression 
(B44Bs; — B4;*) which appears throughout the inversion is 
having a powerful effect on the expressions for generator loadings, 
and in particular on the term 174-902, in eqn. (66). 


R 
By = 4 + Ad 


Now 
[Yal6 
(69) 
and Bss = = 3(1 + A3) 
[M53 
Also, from the 2733 matrix, eqn. (54), 
Ry4 = 24.4 — 2d, + w 
4—4 4.4 4 \ (70) 
and Rs_5 =w 3 
3-0 50 1-6 
9 
25 + 
40 
20 
20 
30 2:1 
Ss 
2:2 
20 
He 2:3 


5 
Fig. 12.—Nomogram for total generation XP. 
1 


m 
P=B(c—=) 

ES) 30 | 5O 30 
2:5 2:5 2:5 

ae 40 

as 
_ a Sa 
— 7 
Pie) = 
20 20, fac 
~ 
~s 
~ 
29 
~—™. 
PSS 

1-5 1's 1-5 

20 
ee) ie) 1-0 

m q m2 PS m3 
(REF. LINE) 
Examples. 


Left-hand Nomogram. 


my, = 2:08 £/MWh to mz = 2:50£/MWh gives q. 


q to m3 = 1-67£/MWh gives r. 
rto ms = 1:46£/MWh gives s. 


Right-hand Nomogram. 


s = 29:8£/MWh to required total generation of 500 MW gives 4. 
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Fig. 13.—Nomogram for generator G, of loading P1. 


s = 3:36£/MWh to A = 2°113 eIMWh (from Fig. 12) gives Py = 52MW. 
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m4 rsm2 M3 
Example. 
As in Fig, 12 to give new value 0 
Now Ligea = 2d4 (71) 


Therefore the value of w appears to be influencing the com- 
ponents of B~!, 

From a study of the components of w (=0-014558), it is 
found that the term Z7,7/3/{ = Z3_3/;/{ (=0-009056) is exces- 
sive in relation to all other quantities. 

It can thus be concluded that the generator self-impedance 23,3 
(with unit current injected at generator G;), the loading ratio /, 
(i.e. load at Ince as a proportion of total load) and the ratio 
4 of reactive to active power for generator G, (Crewe), at normal 
load, are having powerful effects on individual generator loadings. 

This investigation illustrates the fact that, in practice, con- 
siderable accuracy in computation will be required when the 
magnitudes of the system parameters and load ratios have wide 
differences. 


(5.2) Loading Schedule Nomograms 


The form of the final co-ordination equations enables a set of 
nomograms to be constructed to represent the various generator 
loadings for optimum operating efficiency. 

The methods given in Reference 34 have been used for the 
five loading equations, including the one for total generation, 
and give rise to five nomograms, two of which are illustrated in 
Figs. 12 and 13. 

Each diagram represents the respective loading equation and 
each variable is represented by a graduated line. Index lines 
drawn across the diagrams according to the values of the 
variables, in this case the station costs, will give a direct solution 
of the equations they represent. The various generator loadings 
are obtained by the use of the respective nomograms, using the 
known m values and the values of A found from the total genera- 
tion nomogram of Fig. 12. 

The example given on the nomograms illustrates a loading 
schedule for a total system generation of 500 MW, and the various 
generator loadings compare with those obtained by calculation 
from the loading equations. 


(6) CONCLUSION 


In this present analysis, fuel costs and losses have been 
co-ordinated using the exact equations in conjunction with linear 


input-output characteristics for each station. With this assump- 
tion, the solutions obtained from one matrix inversion are valid 
for all variations in fuel costs, and the linear loading equations 
may be adapted for nomographic representation as illustrated in 
the paper. These charts will provide a method of developing 
rapidly the economic loading schedules as functions of total 
generation. 

If the incremental fuel costs are assumed as functions of the 
station loadings, then each variation in station costs will neces- 
sitate a matrix inversion. In this case the rapid calculation of 
incremental transmission losses combined with generating costs, 
and the immediate application of these results to the system, will 
require the use of simplified network analysers or digital com- 
puters for the solution of the linear equations. 

Comparing the economic loading schedule of Fig. 11 with the 
loadings obtained from the load flow study, it is apparent that 
for greater operating economy the generation at Percival Lane, 
G,, and to a lesser extent that at Warrington, G,, must be 
reduced and transferred to the more economic sources at Ince, 
G;, and the Carrington interconnection, G;—provided that 
adequate tie-line capacity is available. 

A schedule for minimum loss can be obtained from the fore- 
going analysis, and if compared with that obtained by considering 
the effects of fuel costs, it becomes apparent that the value of the 
low-cost import at G; is much reduced due to the greater effects 
of line losses. Station loadings at G, and G, under these con- 
ditions are greater than the corresponding loadings obtained 
when considering fuel costs. Plant availability, provision for 
security of supplies and relative generation costs usually prevent 
the loading of stations on a minimum-transmission-loss basis. 
Such a condition, however, is of value in the planning of trans- 
mission systems and in comparing delivered costs from stations 
having equal incremental generation costs. 

This analysis has been restricted to the determination of loading 
schedules for a small section of the North West, Merseyside and 
North Wales divisional network. The effects of incremental 
transmission losses on the selective loading of generating plant 
will become more apparent when the analysis is applied under 
light load conditions on this particular network, such as those 
obtaining at night and during the summer months, and when 


applied under more general conditions to larger sections of the 
Grid system. 


NICHOLSON AND LYNN: THE ECONOMIC LOADING OF TRANSMISSION SYSTEMS 


(7) ACKNOWLEDGMENTS 


The authors wish to express their thanks to Professor J. M. 
Meek, D.Eng., of Liverpool University, for his support and for 
the interest he has shown in the work. They are grateful to the 
Divisional Controller of the North West, Merseyside and North 
Wales Division of the Central Electricity Authority, for providing 
and giving permission to use technical data and for helpful 
discussions held with members of his staff. 

One of the authors (H. N.) wishes to thank Lever Brothers, 
Port Sunlight, Ltd., for granting him permission to undertake 
these investigations. 


(8) BIBLIOGRAPHY 


(1) STEINBERG, M. S., and Smrrn, T. H.: ‘Economy Loading of 
Power Plants and Electric Systems’ (John Wiley, New 
York, 1943). 

(2) Georce, E. E.: ‘Intrasystem Transmission Losses’, Trans- 
actions of the American I.F.E., 1943, 62, p. 153. 

(3) GeorcgE, E. E., Pace, H. W., and Warp, J. B.: ‘Co-ordina- 
tion of Fuel Cost and Transmission Loss by Use of the 
Network Analyser to Determine Plant Loading Schedules’, 
ibid., 1949, 68, Part II, p. 1152. 

(4) Warp, J. B., EATon, J. R., and Hatz, H. W.: ‘Total and 
Incremental Losses in Power Transmission Networks’, 
ibid., 1950, 69, Part I, p. 626. 

(5) Kron, G.: ‘Tensorial Analysis of Integrated Transmission 
Systems, Part I—The Six Basic Reference Frames’, ibid., 
1951, 70, Part II, p. 1239. 

(6) KircHMAYER, L. K., and Staca, G. W.: ‘Analysis of Total 
and Incremental Losses in Transmission Systems’, ibid., 
1951, 70, Part II, p. 1197. 

(7) KirCHMAYER, L. K., and MCDANIEL, G. H.: “Transmission 
Losses and Economic Loading of Power Systems’, 
G.E.C. Review, Oct., 1951, p. 39. 

(8) KiRCHMAYER, L. K., and Stacc, G. W.: ‘Evaluation of 
Methods of Co-ordinating Incremental Fuel Costs and 
Incremental Transmission Losses’, Transactions of the 
American I.E.E., 1952, 71, Part Ill, p. 513. 

(9) Kron, G.: ‘Tensorial Analysis of Integrated Transmission 
Systems, Part II—Off-Nominal Turn Ratios’, ibid., 1952, 
71, Part III, p. 505. 

(10) Kron, G.: ‘Tensorial Analysis of Integrated Transmission 
Systems, Part [iI—The Primitive Division’, ibid., 1952, 
71, Part III, p. 814. 

(11) Guin, A. F., KircHMaAyer, L. K., and Stacc, G, W.: 
‘Analysis of Losses in Interconnected Systems’, ibid., 
£952) 71, Part 10, p.1796. 

(12) Gun, A. F., HABERMANN, R., KiRCHMAYER, L. K., and 
Stacc, G. W.: ‘Loss Formulae Made Easy’, ibid., 1953, 
72, Part III, p. 730. 

(13) Kron, G.: ‘Tensorial Analysis of Integrated Transmission 
Systems, Part IV—The Interconnection of Transmission 
Systems’, ibid., 1953, 72, Part IM, p. 827. 

(14) Kircumayer, L. K., Guin, A. F., and Stace, G. W.: 
‘Analysis of Losses in Loop-Interconnected Systems’, 
ibid., 1953, 72, Part UI, p. 944. 


419 


(15) BROowNLEE, W. R.: ‘Co-ordination of Incremental Fuel 
Costs and Incremental Transmission Losses by Functions 
of Voltage Phase Angles’, ibid., 1954, 73, Part III, p. 65. 

(16) Impuraia, C. A., KIRCHMAYER, L. K., and Stace, G. W.: 
‘A Transmission-Loss Penalty Factor Computer’, ibid., 
1954773; Part DMS ps 567. 

(17) Harker, D. C., Jacoss, W. E., FerGuson, R. W., and 
Harper, E. L.: ‘Loss Evaluation, Part I—Losses Asso- 
ciated with Sale Power—In-Phase Method’, ibid., 1954, 
73, Part III, p. 709. 

(18) Harper, E. L., FERGUSON, R. W., JAcoss, W. E., and 
Harker, D. C.: ‘Loss Evaluation, Part 1—Current—and 
Power—Form Loss Formulae’, ibid., 1954, 73, Part IL, 
p. 716. 

(19) TRAVERS, R. H., HARKER, D. C., LONG, R. W., and HARDER, 
E. L.: ‘Loss Evaluation, Part I[I—Economic Dispatch 
Studies of Steam-Electric Generating Systems’, ibid., 1954, 
73, Part III, p. 1091. 

(20) Harper, E. L.: ‘Economic Load Dispatching’, Westing- 
house Engineering, 1954, 6, p. 194. 

(21) GeorGE, E. E., and Prerce, R. E.: ‘Economics of Long- 
Distance Energy Transmission’, Transactions of the 

American I.E.E., 1948, 67, Part II, p. 195. 

(22) GeorGeE, E. E.: ‘Principles of Load Allocation Among 
Generating Units’, ibid., 1953, 72, Part III, p. 49. 

(23) Warp, J. B.: ‘Economy Loading Simplified’, ibid., 1953, 
73, Part Ill, p. 1306. 

(24) Hae, H. W.: ‘Power Losses in Interconnected Transmission 
Networks’, ibid., 1952, 71, Part III, p. 993. 

(25) Kron, G.: ‘Tensor Analysis of Networks’ (John Wiley, 
New York, 1939). 

(26) ZABORSZKY, J., and RITTENHOUSE, J. W.: ‘Electric Power 
Transmission’ (The Ronald Press Company, New York, 
1954), p. 581. 

(27) Cooprr, A. R.: ‘Load Dispatching and the Reasons for it, 
with Special Reference to the British Grid System’, 
Journal ILE.E., 1948, 95, Part II, p. 713. 

(28) OLpDRoyD, G.: ‘Economy Loading of Generating Plant’, 
Electrical Review, 9th December, 1949, p. 1095. 

(29) Don, N.: ‘Economical Loading’, ibid., 19th January, 1951, 
Dal 

(30) Tomps, F. L.: ‘Economic Loading’, ibid., 23rd March, 1951, 
p. 581. ' 

(31) Parsons, L. J., and MARTENS, C.: ‘Computer Matches 
Incremental Rates’, Power, August, 1950, p. 114. 

(32) CouRANT, R.: ‘Differential and Integral Calculus’ (Inter- 
science Publishers, New York, 1936), Vol. I, p. 188-211. 

(33) Lacopo, M. J.: ‘Interconnected System Energy Accounting 
Procedure and Related Operating Practices’, Transactions 
of the American I.E.E., 1953, 72, Part IU, p. 1196. 

(34) Attcocx, H. J., and Jones, J. R.: ‘The Nomogram’ 
(Pitman, 1941). 

(35) SEREBRENNIKOV, V. N.: ‘Determination of Data for the Load 
Dispatcher’s Selection of the Order of Loading Turbo- 
generator Sets’, Elektricheskie Stantsii, 1954, 12, p. 6. 

(36) BILLA, P. M.: ‘Some Impressions of Electric Utility Prac- 
tices in the U.S.A., LE.E. Student’s Quarterly Journal, 
1954, 25, p. 15. : 


621.313.1 


The Institution of Electrical Engineers 
Monograph No. 2958 
Mar. 1958 


TENSOR ANALYSIS OF ELECTRICAL MACHINE HUNTING 
By J. W. LYNN, M.Sc., Associate Member. 


(The paper was first received 6th August, and in revised form 25th BA 1957. Jt was published as an INSTITUTION MONOGRAPH 
‘ in March, 1958. 


SUMMARY 


The paper gives first a brief résumé of previous work on tensor 
analysis of electrical machines. The steady-state equations of the 
synchronous machine are written in Park’s reference axes, and from 
these the hunting equations are derived. It is then shown that these 
equations are part of a general group of transformations of reference 
axes of the synchronous machines, all of which are embraced by the 
general tensor equations. The hunting equations are then derived in 
a freely-rotating reference system. These equations are rewritten in 
tensor form and the significance of the grouping of terms into tensors 
is discussed. The latter equations are shown to give a more realistic 
interpretation of the hunting equations and the equivalent circuit. 


LIST OF PRINCIPAL SYMBOLS 
Indices. 
a, b, c = Quantities in axes fixed to the machine stator 
and rotor windings. 
k, n, m = Quantities in axes all relatively stationary. 
a, B, y = Quantities in axes fixed or free on the stator 
and rotating freely on the rotor. 

s, t = Quantities associated with the mechanical 
rotational effects in the machine (e.g. 
generated voltages and torque). 

u, Vv, W = Quantities in a general equation. 


Electrical parts of the equations. 

Vi», Vx, etc. = Electrical voltage vectors in axes denoted by 
indices. 

, x*, etc. = Electric variables. The electrical charges in 
machine windings, referred to axes denoted 
by indices. 

x* (=i*) = Electric current vector, in axes denoted by 
indices. 

‘Ryo, = Resistance matrix, in axes denoted by indices. 

L. = Inductance matrix, in axes denoted by indices. 

Gy, = Generated voltage coefficients, in axes denoted 
by indices. 


xk 


Mechanical part of the equations. 
J, = Mechanical force. 
x* = Mechanical variable 6, the angular position of 
the machine rotor during rotation. 
X$ (= iS = p0) = Angular velocity of machine rotor. 
R;; = Mechanical friction coefficients. 
L,; (=J) = Moment of inertia of machine rotor. 


General symbols. 

v = Generalized force vector (voltage or mech- 
anical force). 

R = Generalized dissipation matrix (resistance or 
friction). 

L = Generalized inductance matrix (inductance or 
inertia). 

i = Generalized current vector (electric current or 
angular velocity). 


Correspondence on Monographs is invited for consideration with a view to 
publication. _ é 
Mr Lynn is in the Department of Electrical Engineering, University of Liverpool. 


Ck = Connection matrix between quantities in axes 
denoted by indices. 
C = Direct notation for (Ops ie, 
Ci) = Transpose of matrix C. 
O28, = ‘Non-holonomic object’ containing functions 
of C, in axes denoted by indices. 
[~8, y] = A ‘connection’ term containing functions of 
L,« in axes denoted by indices. 
Tug, = A ‘connection’ term containing both [«f, y] 
and (2.2, in axes denoted by indices. 
Sey = Tensor giving the terms Gq. 


(1) INTRODUCTION 


With rapid expansion in the field of control systems a fuller 
understanding of the dynamical behaviour of rotating electrical 
machines has become of increasing importance. For this reason, 
and also because of general developments in electrotechnics, the 
teaching of electrical machine theory from a _ generalized 
dynamical viewpoint is now being considered in universities 
and colleges. Brown, Kusko and White! give details of a 
laboratory machine for teaching purposes, the windings of 
which can be interconnected in a variety of ways to give the 
characteristics of a range of d.c. and a.c. machines. 

One of the pioneers of this approach to electrical machine 
analysis was Gabriel Kron.?:3>4 The matrix and tensor 
methods which he has developed since 1934 have led to a better 
understanding of the fundamental concepts underlying all 
machine systems. A survey of these methods is given in 
Reference 5. The above References show that the analysis of 
most types of electrical machines can be expressed by a single 
set of dynamical equations. 

Recently the behaviour of oscillating-machine systems has 
been receiving a great deal of attention.®°.7>8 Transient and 
hunting conditions have been the subject of investigation 
throughout the history of machine analysis. The development 
of hunting analysis of synchronous machines can be indicated 
briefly by selection of one or two representative publications, as 
follows. 

In 1929 Wennerberg? extended the early work of Kapp and 
Rosenberg. Starting with the design details of a 3-phase salient- 
pole machine he resolved the armature magnetomotive-force 
and flux waves into two axes in quadrature on the armature 
(stator). During hunting the field rotates and oscillates with 
respect to these axes. Wennerberg then derived equations for 
voltages, currents and torque during small oscillations of the 
rotor. The expressions are complicated because of the fixed 
armature axes chosen. All steady-state currents and voltages 
are functions of sin wt and cos wt (where w is the synchronous 
angular velocity of the rotor), and the hunting equations are, of 
course, obtained by making small changes in steady-state values. 
In his expressions for hunting torque the trigonometrical terms 


of synchronous frequency ultimately disappear and the torque is 
expressed as 


AT = A’X sin (hw)t + A” (hw)X cos (hw)t 


where hw is the angular velocity of hunting. Inspection of this 
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eo | synchronizing and damping torque coeffi- The physical concepts RS in the new reference frame have 
eae ae Ric Lafase Cece enn been examined in detau. In order to generalize his work on 
Boxy 5 on,” using implicitly the same reference machines, Kron uses the methods of tensor calculus. Heffron, 
yst as Wennerberg, derived a comprehensive set of equations Rosenberry and Rothe® 7 have given an alternative, more con- 
giving the damping and synchronizing torque coefficients of a ventional, analysis in which they compare hunting equations of 
salient-pole alternator. They examined, in particular, the effects an interconnected system in the reference frames of both Park 
of armature resistance and armortisseur parameters on hunting. and Kron, and point out the advantages of the uniformly 
Curves are then given showing calculated and experimental rotating axes. 
results for a laboratory machine. In the theory of relativity, Einstein’s quest was for laws of 
About this time the two-reaction theory of the salient-pole nature that would hold irrespective of the reference frame chosen. 
synchronous machine was developed. In 1926 Doherty and Kron’s approach to machine analysis has been from the same 
Nickle,!! following Blondel, resolved the armature resultant viewpoint. He looked for basic concepts which exist in all 
m.m.f. and flux-linkage waves into axes along the field pole and machines regardless of the reference axes and for equations 
in quadrature with it. These axes were considered to rotate expressing these concepts. He formulated these for his primitive 
synchronously along with the field structure. Both the armature machine and used already existing tensor analysis to deal with 
and field quantities are therefore constant along these axes in the the transformation of these equations, to give those of any 
steady state, trigonometrical terms at synchronous frequency required machine with any chosen reference axes. As in 
having been eliminated by transformation of the phase quantities _ relativity, it was found that the fundamental machine concepts 
into these axes. During hunting the reference axes rotate and having physical significance in steady-state, transient or hunting 
oscillate with the field structure. In 1929 Park!? expressed this operation were all tensors.!® Equivalent-circuit meshes were 
theory in terms of transient or ‘operational’ impedances in seen to yield groups of terms which constitute tensors. Con- 
these axes, which could be measured directly by tests on the versely, if equivalent circuits were to be set up, the terms of the 
machine. Park then extended his theory to include hunting equations should be grouped into tensor quantities. Apart from 
conditions. His equations of hunting are physically the same equivalent circuits, this tensor grouping of the equations appears 
as those of earlier investigators, but in terms of machine para- to give a better physical picture of the correlation of different 
meters they are more explicit. They are also more comprehen- forms of energy in any physical system. Prof. Kondo?° has used 
sive in that the operational impedances incorporate the effects of | tensor equations, identical in form with the machine equations, 
the rotor (field) circuit parameters as well as those of armortis- for the analysis of aircraft oscillations in which aerodynamic and 
seurs. Park’s equations have now become generally accepted other forces are considered. 
in synchronous-machine theory. The present paper shows how these tensor terms arise in 
Liwschitz,!? using the latter reference axes, has analysed the hunting analysis of a synchronous machine and how the tensor 
hunting of a synchronous machine as a special case of the grouping of terms is associated with the equivalent hunting 
general problem of a doubly-fed machine. Concordia!* has circuit. The equations are particular cases of the general 
given a very comprehensive set of results of the application of machine equations, and the same analysis is therefore directly 
Park’s analysis to a particular machine. applicable to many other types of d.c. and a.c. machines. 
In 1942 a.c. machines, including the salient-pole synchronous 
machine were described by a general theory of equivalent circuits, (2) TENSOR EQUATIONS OF ELECTRICAL MACHINES 
by Kron,!> for both steady-state and hunting conditions. These i ; 
circuits were subsequently used by Concordia, Kron and (2.1) Matrix Equations 
@rary.'® 17 The voltage equations of the stationary-axis primitive machine 
The above work is confined to analysis of behaviour of a shown in Fig. 1, with axes ds, dr, gr and gs, can be written down 
single machine synchronized with a large system. Kron!> by inspection. Written in matrix form these are as follows: 


ds dr qr qs 


= ea 32) 


—M,p0 


indicated that the analysis and equivalent circuits in Park’s These are seen to include those derived by Park}? in 1926 for 
reference frame become complicated when external circuits are the synchronous machine along direct and quadrature axes, 
connected to the machine terminals. The synchronous-machine — usually written”! as follows 

axes rotate and oscillate with the field structure, and the external 

network must also be analysed along oscillating si hae an Impressed field voltage. 

se hich are identical with those of Park in the steady ; * 

ee oer chien rotate uniformly and do not oscillate.!8 Equiva- Vpq = Rygif? + Lypifé + Mgpi® . . . 
lent hunting circuits were derived along these uniformly rotating 
axes, which could be interconnected to build up complete 
eystems.7:19 These circuits are such that the Same power Va, = — Mypif4 — Ry it — Lapi® + Logit . 6) 
joss 1 h mesh gives the damping torque. Synchronizing . 

Bens ths be fad from the circuit. Var = — Mgif4pO — Lg it pO — Rai?” — LgpiY . (A) 


Generated voltage. 
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DIRECT AXIS 


ds y 
, 
dr ps : 


$7 
7 
4, 
a 


QUADRATURE 
bee 
AXIS 


qr 


S) 
by 


2 


(a) (b) 
Fig. 1.—The primitive machine. 


(a) 2-phase synchronous-machine field structure rotating. 
(b) Primitive machine with armature rotating. 


Generated electrical torque. 


Fixe if ie <5  ee) 
where ig = Mi Di we ag (9) 
and fp SL LE ee OT) 


In eqns. (3) and (4) the quadrature field axis has been omitted 
for simplicity. 

Equivalent circuits for the synchronous machine have been 
developed by Kron!> and studied by Ku** by resolving the set 
of eqns. (1) into forward- and backward-rotating instantaneous 
symmetrical components. 

Eqns. (1) are of the form 


v = Ri+ Lpi + Gipd ere tee ame) 
or Di= 7 i Re ee ee) 
where Z=(R+L1p+Gp0 ... . (0) 


The torque is given by 
f= G- ol oes ae ee eee CU) 


(the asterisk denoting conjugate values), the ‘torque matrix’ G 
being 


ds dr qr qs 


(12) 


can be transformed to give that for any other commutator 
machine, by the simple transformation 


Z! =£j12nO een 


where C is the matrix connecting mesh currents in the derived 
machine windings with the branch currents of the primitive 
machine coils. If the transformation is to be carried out to 
any reference axes which are rotating relative to the direct and 
quadrature axes, then, as shown in Reference 3, the transforma- 
tion law is 


oC 
Z’ = Cw. Z.C + Cay. L. soph - 5 . (14) 
where p0’ is the angular velocity of the reference frame with 


respect to the direct and quadrature axes system. This leads to 
an equation of the form 


v = Ri’ + L’pi’ + Gi'p0 + Vip’ «ws sO) 
or aw Ae Ae ea eee UC 
where Z =(R'4+L’p + Gpd+ Vp). . . (17) 


If the direct- and quadrature-axis quantities are transformed to 
uniformly rotating axes S; and S,, the connection matrix C is 
given by the relation between the currents in the two systems as 
shown in Fig. 1. 


j¢s — jds 

i#” = {51 cos &” — iS? sin 0” 

iv = iS! sin @ + iS2 cos 6’ 

[Oo s= BE Pa ee are (GIG 
and dr S; S> qs 


(19) 


The terms of expression (17) are given in full in Reference 5. 

If the angle 0” is, in fact, the load angle A, the axes S, and S, 
coincide with the voltage axes of the machine. The angle A is 
constant in the steady state and the last term in eqn. (15) is zero. 
The impedance matrix is then given by the transformation (13), 
which gives (neglecting axis qs) 


It is shown in Reference 3 that the impedance matrix of eqn. (1) Zi = Cy. ZG.) ee (20) 
ds Si S> 
Yas + Lasp M, cos Ap 
M,,cos Ap Var + Lay cos” Ap (Lj, — Lg,) sin A cos Ap 
ae —Mz, sin \p6 + Ly, sin* Ap + Lg, sin? Ap9 
aa +(Lqr — Lar) sin A cos ApO + Ly, cos* Ap erry el 


—M, sin Ap (Lap — 
—M, cos Apé — Ly, cos* Apé + Ly sin? Ap 
— Ly, sin? ApO —(Ly, — Lg,) cos A sin Ap 


Lz) sin A cos Ap Tapaiad en COS On 
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The matrix multiplication shown above, to give a change of 
reference frame, is extremely simple, and the obvious question 
that arises is whether any knowledge of tensors is required and 
whether there is any advantage in learning a new mathematical 
technique involving, among other things, a complicated index 
notation. The answer begins to appear when one looks at the 
general law of transformation of impedance during small 
disturbances of the machine voltages, currents and speed.> If 
the hunting impedance in Park’s reference system is &, the 
hunting impedance matrix in a free frame is given by 


dC 
£’ = ie eC Oly. so |i + 6’) 


dC. dC oc 
dCq) Bee) Aa 


The purpose of tensor analysis is to present all the trans- 
formations of machine reference frames in a consistent dynamical 
theory for steady-state and hunting analysis of all types of 
machines. The fundamental ideas underlying the tensor 
approach as developed by Kron have been examined in 
Reference 5. The salient points are now surmmarized. 


(2.2) General Tensor Equations of Electrical Machines 


In Reference 24 the different concepts of flux linkage and 
generated voltages arising under different transformations of 
machine reference axes have been classified. These components 
change with the angular velocity of the reference frames and 
‘some arise in one system of measurement and disappear in 
others. When machine equations are expressed in tensor form 
these various voltage components are part of a total ‘tensor’ 
voltage which includes both flux linkage and flux density. A 
tensor voltage cannot disappear under any transformation, and 
it is this, whatever its components, that is associated with the 
invariant power and stored electromagnetic energy in the 
machine. Resistance drop is also a tensor voltage. 

Another advantage of using tensor equations is that there are 
available general routine laws covering all possible transforma- 
tions of reference axes, the machine power remaining invariant 
with each change of reference system. 

The tensor equation of the rotating electrical machine is an 
equation which has the same basic terms for all machines, these 
having different components for each machine or with each 
reference system chosen. The motion of the rotor is included 
in the form of rate of change of self and mutual coupling of the 
rotor and stator coils with changing rotor angle. 

The voltage equation of a single coil having resistance and 
inductance is written 


Sts oreast ont a2) 


v=Ri+L 


The equation for a set of coils, some of which rotate with respect 
to the others, becomes 
éi4 “ 


UN = Ryul” ar Liz ° (24) 


hed . . . 

where as is the ‘absolute’ derivative of the current with respect 
t . 

to time. Every term is then a tensor. As shown in Reference 5, 


di di# 9 
tie ST 


oS De) 
dt dt e) 
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where I°¥,, contains functions of the matrix C which relates the 
currents in two different systems, 


it = C¥ie (26) 
Ol oe. Ort 
d ee ere va 
ta di dic GY 


The indices range over the different machine variables, namely 
the electrical charges in each coil and the angle of rotation of 
the rotor. The coil currents and angular velocity of the rotor 
are then written; for example, 


Patag" 
do 
d P= = = EE Me i Batt 4 ) 
an is, 0 (29) 
The index s is used to denote the mechanical variable, the angle 0. 
Also oe — aie sl ae whit & (30) 


The machine equation in its general form therefore becomes 


Ow = Ryul® + Lyypi® + Te wht? (31) 


The whole of tensor analysis of rotating electrical machinery 
is based on a knowledge of the components of the term I’, yi? 
in eqn. (31). When these are understood and facility in mani- 
pulating the index notation has been attained, the group proper- 
ties of electrical machines become clear and analysis of a wide 
range of machines under many different operating conditions 
becomes a matter of routine procedure. The study of the I’ 
terms and their expansion have been carried out in detail in 
Reference 5. 


(2.3) Synchronous-Machine Systems 


The analysis of synchronous machines can be carried out 
using any one of three reference systems shown in Fig. 1, 
namely 


(a) Actual phase quantities, or 2-phase co-ordinates of these. 

(b) The reference frame of Park. This is much more suitable for 
most cases, and is in general use for synchronous-machine studies 
at constant speed. 

(c) The free frame. In this system the field quantities are referred 
to axes fixed on the field as in reference systems (a) and (b). Arma- 
ture quantities are referred to axes which rotate uniformly with 
respect to the armature as in reference frame 2, but these can be 
chosen to have any uniform velocity, they can have any fixed 
position relative to the field structure, and the angular velocity of 
the reference frame is independent of any oscillations of the field 
structure. The simplest case is that in which the free frame coincides 
with that of Park under steady-state constant-speed conditions. 


The machine equations in each frame have the same form and 
are identified by different systems of indices as follows: 


Frame (a) 0,= Rigit + Lope + Va ee (32) 
Frame (b) Ui, = Kgl a Lippi L pont oe (33) 
Frame (c) Oy se Regk te Lee Dl? tL ob, yt Ee (34) 
where en — (Cady (35) 
f Bp hi OG (36) 

etc. 


The most general form of the term I’, in eqn. (31) is, as 
shown in Reference 5, 


—1/0L yy, Lyn 
Pur, = 5 Oxy - OX, 


a Swou re Suv aE Suvw Sip ea wr One 


) 


$< X24, w (37) 
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In the voltage equation 


x’ =6 and iv =p0. (38) 
In the steady state 
Olan “ud 
Di wit’ ES (<2 cores 2S ee a 20 n,n) Ll - (39) 
— Sou = Cy. G.C=G’ (40) 
and the torque matrix in the new frame is 
Dg oy 41) 
OG Fhe = 00’ Cyl — el ° ( 
where L’ is the inductance matrix in the new frame 
wu 
and e= (42) 
Thus [’,,, ,i“i” becomes, in the steady-state equations, 
OL’ sf ; 
_. pO’ .i’ + G’.i’. 90+ pL’. i’. pO’ . (43) 


00 
where p@ is the angular velocity of the rotor and p@’ is the angular 
velocity of the reference frame with respect to the rotor. 

In reference frame (a) the second and third terms of expres- 
sion (43) do not arise. They appear only in transformations 
from reference frame (a) to any other reference frame. 

In reference frame (b) the first and third terms are zero. 

In reference frame (c) all three terms arise. An important 
point is that, while inductances along reference frame (c) can be 
constant in the steady state (if the axes rotate at synchronous 
speed or coincide with Park’s axes), they are subject to incre- 
mental changes when the field structure oscillates. This means 
that a term such as (0L’/00)p6’, which arises in the general 
equations, may be zero in the steady state, but under hunting 
conditions it becomes (0L’/dA)p(AA) and must be retained. 
The use of the general form of the term [’,,.,, in hunting equa- 
tions ensures that all possible interactions of currents, fluxes and 
speed, together with their increments, are included in the final 
equations. 

The disadvantage of Park’s equations [frame (5)] for hunting 
analysis is that the armature reference frame is fixed along the 
field structure and oscillates with it. This means that circuits 
connected to the armature must also be analysed along the same 
oscillating reference system.!? It is simpler and more realistic 
to analyse the machine in the first place along reference frame (c). 
External networks can then be more easily included. It is 
reference frame (c) that leads to the new equivalent circuit in 
which the system damping torque is given by the network mesh- 
resistance power loss. 


ds dr 


Map 
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(3) SYNCHRONOUS-MACHINE HUNTING EQUATIONS. 
PARK’S REFERENCE FRAME 
The machine is that shown in Fig. 1, with armature axes dr 
and qr, stationary with respect to the field. The quadrature field 
circuit has been omitted in the following equations. The 
resistance, inductance, torque and impedance matrices are as 


follows: : 
v = Ri+ Lpi + Gipd 


or v=Z.i 
where ds dr qr 
Yas + LgsP Map 
Za Map pre eee (44) 
V4, p80 —L pp 
ds ie ds 
Resear 
qr . (46) 
(47) 


The hunting equation may be found as in dynamics by con- 
sidering small increments of the quantities in the steady-state 
equations (8) and (11). 


Thus (v + Av) =(Z+ AZ) + Ai) (48) 
or (v + Av) =(R+ ARG + Aid + (ZL + AL)pG + Ai) 
+ (G+ AG)G + Ai)(p8 + Apé) (49) 


Multiplying out and subtracting the original equation the 
hunting equation of voltage is found: 


Av =(R + Lp + Gp®)Ai + Gi(Ap®) . . (50) 
The torque equation becomes 
Af=i*.G. Ai + Ai*.G.i (51) 


The matrix form of eqns. (50) and (51) can now be written, 
including a mechanical row and column involving the mechanical 
variables. 

The equation has the form 


Av == gs . Ai . . . ° 
which, when written in full, expands to 


(52) 


qr Ss 


Var LiyP 


ae Lap 6 


2i7 Lp 


~ 1.3m 
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\ where Lp => (Li, aS 1b) 


(57) 


‘and J is the rotor inertia constant. 
These equations have been studied in detail both analytically 
(and by means of equivalent circuits,® 22 and they will now be 


used only asa starting-point for the study of the hunting equa- 
' tions expressed in the free reference frame. 


Aids 
Aid’ 
ia’ 
AA 


ds 
d’ 

q’ 
s 


. 


S D> Bz 
Saks eee ae 
ee eS SS eg ee as 
Pasa Prete ae Se 0) Se ne 
(4) HUNTING EQUATIONS IN THE FREE FRAME as |G? aG%e oa ate lees 
; : = 3 xX == 
In the free frame the reference axes do not oscillate with the 22 Sx = = cl & e es 2 is a es 2 = is = e 
‘Totor. The equations are first expressed along the axes d’ and ,, | -& 818% 28% 2 8% %S8a%] 8S e494 
q of Fig. 2, rotating synchronously with the terminal voltage Ss Sos a 3S ae Sect fs 23 33 
vector position. Naku a a a 
y | | iaTias oo ie +3 33 | 
: See ee |S alee 
Sete Sea ipl mee! 
a+ A & QA & Qs SS 
Sa alas Loelie abi eS 
ar = ar Nl ta 
8, 
: R BSH 
s 2 De 
a 2) 
&, 8a be a | ps 
as xa Se res Wee = 
q qaa Onaeo 6a B 
— B ARES ete o 8 AN 
S ann ss Ses 
Bey We BS 
| SY ae a re 
| ails ad, tet 
Fig. 2.—Free reference axes. leateect: a | Sie 
Axes d’ and q’ along terminal voltage axes. & er ey | rl} 
Axes d and q coinciding with Park’s axes. =) ~ KN 
| w+ 
This may be done in either of two ways: | 
(i) The hunting equations (53) can be transformed directly to a 
those in the free frame using the law of transformation given by A 
eqn. (22). In this equation the p-operator in & refers only to z a = 
Ai’ or AQ’ and not to C; and in Z, prefers only to Cand i’ and not oS = | c 
to AO’. & 8 2 2 
(ii) The more elegant method is to set up the steady-state equations &, gy & i ORs Ai ic 
in the new frame and from these to derive the hunting equation. os g oar: Pp ~~ dias 
: : : ; 7 fe) Sch ae2y ZN Sivas 
Fhe steady-state equation is eqn. (34). Taking small incre- = ey ees a si S Ze 
ments of this as before, the hunting equations, as shown in = ab a F i) 35 z S| 
detail in Appendix 11.1, are as follows: =e es Laie poise 
wv > We 
: Nee — & o~ i 
Voltage equation. 3b = | 
Wa | 
pecs 6 — 
Ne, + re 
(RopAi® + LogpAi® + Tg, .p8Ai® + Py, .pOAi) 
‘ ols Xe 
| Dgs,o8°p + Ds, ai%P + 5 =i" pb — eS & 
a na Ss & S (=| 6 
ie OL 8 ae 3 eh ed Zio 
Ree ey,) a8 7.6 AX ft SS me 
i ++ 1 . (54 z 3 
any etl yee (54) 8 $5 = SE 
which becomes, for the synchronous machine, 
; OL 0G’. 
Av’ = (R’ + L’p + GpO)Ai’ 4 (5? on! pO) Ar 2(335) 


the torque equation is 


, 


a a 
“a os ey os G’ ai + 2S ese 87 Ar see | 2A 

maa Ni *.G.i +i *.G .Al ax ip E S > z 

yn Meena ALG <I = = = 

<i » 


+n matrix form these expand to 


426 


These equations are very much simplified when the free axes 
are considered to coincide with Park’s reference axes In the 
steady state. The angle A then becomes zero. This does not 
imply that the machine load angle is zero, but only that the 
reference frame has been rotated. The load angle A is still 
inherent in the equations in the computation of the steady-state 
currents involved in the hunting equations. 


ae : Oven A 
In the free frame along the field axes the machine-impedance Svy = Ryadi® + Lye (8i%) + Kyroyitibdx™ 
matrix becomes 
ds d q Ss 
dee a eee eee 
=, i 
ds tage Loge Map Maitp 
(Lor = Ly)ilp 
ie Te 00, ie 
d Map Tp or drP q Mp 
Uke re Lap itp 9 (58) 
So = (Fb pera L itp 
q —M,pé grDO lr, a Lap ( ” Mai 
Jp* + M,i*i4 
; M,i# 7d 
M,i4 —(L,, — Lzg,)i4 a ; —(L,, — Ly,)iti 
AY dl ( q dr —(Lyy et Lait (re i L,)iti4 


(5) THE TENSOR EQUATIONS OF HUNTING 


The time rate of change of a vector with respect to axes 
fixed on the vector is written di*/dt, where the components of 
the vector in the given reference frame are i*. With respect to 
another co-ordinate system, e.g. one that rotates with respect 
to the original vector position, the time rate of change becomes 
the absolute derivative 6i%/dt, where, in general terms, 


di di Gea AE 
id | nC) 
a ede Dei dt (69) 
The absolute differential is written 
di% = dit + D4,iPdxv (61) 


In setting up the conventional equations of hunting, small 
changes Ai*, etc., were considered in each of the terms of the 
steady-state equations. No consideration was given as to 
whether the resulting equations were tensor equations. In fact, 
as stated in Section 2.2, in general the ordinary differential of a 
tensor is not a tensor. The absolute differential shown in 
eqn. (61) is a tensor. The tensor equation of hunting has been 
developed by Kron in Reference 2, by taking absolute increments 
di%, etc., in each of the terms of the tensor steady-state equation. 

Taking absolute increments of the steady-state eqn. (34) gives 


Oy 4 Av.) = (Rya 4 ORya)(i* els Baie Cio a3 1y8(2) 


dt 
(62) 
and the tensor equation of small oscillation becomes 
, 51% 
vy = Rodi + LwS(—) . (63) 


It is now necessary to express this equation in terms of 5i%. 
This change introduces a new tensor which in geometry is called 
the Riemannian—Christoffel curvature tensor, because it gives a 
measure of the intrinsic curvature of any given space in 
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Riemannian geometry. It arises from the fact that, as shown 


in Appendix 11.2, 


Sit ee Sis i 
(82) — Bom —aagenan | 


where the term on the right-hand side is the new tensor. 
Eqn. (63) now becomes 


(65) | 


Appendix 11.2 shows how this equation is expanded to give the 
machine voltage and torque equations of hunting in tensor form. | 
These are written by Kron in Reference 19 as follows: 


Voltage equation. 


Sv, = fed? + Lan 6h + Kpecyi(pO)AX (66) — 


or 


Av’ = {te + L’p + G’p0\Ai’ + Si (2) +G’. pi’. pO. aa} 


0G’ 
meee yA J LTA 67 
+(% G ei | 0 AA (67) 
Torque equation (neglecting the friction tensor Ryqi%). 
of = {! 5 on} + KejasiSi@AA (68) 


i=: {7 f (dw) 


= [AIA GUE G Bi i eG aor anit 


ic [i -. anh a Ga ei [aa 


The first set of terms in square brackets in eqn. (69) is part of 
the second term on the right-hand side of eqn. (65). It gives a 
complex quantity. The real part is in phase with the increment 
of angular velocity Aw, and is therefore in time quadrature with 
the displacement angle AA and gives the damping torque. The 
imaginary part is in anti-phase with the displacement angle, — 
and is counted a negative synchronizing torque. The second 
square-bracketed set of terms in eqn. (69), which is given by the 
new tensor term, is seen on inspection of its components to 
have only a positive real value, in phase with the displacement 


(69) 
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angle. It gives the machine positive synchronizing torque. 
The matrix components of these equations are examined in the 
following Section. Eqns. (67) and (69) are seen to consist of 
the non-tensor equations (55) and (56) with a term added and 
subtracted. Thus the tensor equations of hunting give the 
conventional equations, with an important difference, namely a 
regrouping of terms which leads to a change in interpretation.” !9 
The mathematical implication of the regrouping, from non- 
tensor to tensor form, has been explained by Hoffmann.25 


(6) EQUIVALENT CIRCUIT FOR HUNTING EQUATIONS IN 
THE FREE REFERENCE FRAME 

Equivalent circuits for a.c. electrical machines can be obtained 
by operating upon the impedance matrices in such a way as to 
make them symmetrical.4:2_ The primitive machine is an 
equivalent 2-phase machine, and resolution of the direct and 
quadrature quantities into 2-phase symmetrical co-ordinates 
leads to the required symmetry of the impedance matrix. 

In the free frame the transformation is 


jas — jds 
it = (if + P)f/2 
sd ES eat) IN Le 


(70) 


jas ae jas 
if = (i4 + ji J4/2 
ib = (#4 — ji [/2 


and 


(71) 


and LOS hie CO 
v’ 


LR lle , 
vw’ = Cy. 


= Oi 


(72) 


oe 


the asterisk denoting conjugate values. 


ds if b s 


ds | »/2 


(73) 


The impedance matrix (59) now becomes 


ds f 


1 
Vas + Lasp Aviat dP a) 


1 


Up ry, ol Ls(p — jp?) 


My(p — ip) 


titi 


Ly(p — ipo) 


427 
Ly, —L 
where L.= ate and Lp = oe 
In this matrix, bp = (ba + jb l/2 } 75) 
by = (bg — jb) lV 2 
where b, and 5, are defined from matrix (58): 
by es By 5 i9 
Og d d (76) 
(Na od By 
ee (7) 
B, = Lat + M,i% 
Wal ae 4 
By =.(By + iB, IV2 =i(FyMai4 + #Ls + PLp) 
1 (78) 
By = (By — iB) = —i( FyMai® + Lp + Ls) 
and bp = — (By — ifjLs + PiLp) 
1 Laer 
= -i( ane 2iLy) (79) 
b, = — (B, + PL — iF{Lp) 
Sa eee ng. Se an 
=i( Mai S + 2i Lp) . ( ) 


The additional term in eqn. (67) which is added and subtracted is 

G’. ei’. pO. AA (81) 
This set of terms arises only in the last column of the voltage 
part of matrix (74). Thus the last column of this part of the 
matrix can be written 


ut 


v7 0G 7 ur e// 
OL i’’p(AA) ae {Ie’. ei’) = gE —G 6 Q2 }}reaa . (82) 


oA 
where G” is now the ‘symmetrical component’ form of the 


matrix G. The terms of expression (82) in square brackets 
expand in matrix form to 


ds if b 


ine : ; 
imal! — *) 


by(p — jp?) 
(74) 


Lp(p + jp?) 


1 é 
yale + jp?) 


r, + Ls(p + jp9) 


by(p + jp?) 


by —by 


Vou. 105, Part C. 


Jp” =e jb,i? 
—jb,if 
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ds iP b 


(84) 


(85) 


Matrix (85) given by the tensor K is seen to comprise the 
quantities By and B,, and expression (82) therefore divides the 
quantities br and b, into two significant parts: 


by = — Bp + (iffLs —i?jLp) . 


by = — B, — @jLs —ifLp) . 


(86) 
(87) 


The equivalent circuit can now be drawn for steady hunting 
conditions at the hunting frequency hw. This is shown in Fig. 3. 


gs! 


Fig. 3.—Equivalent circuit for salient-pole synchronous-machine 
hunting. 


The addition of injected voltages and currents is indicated a priori 
by the grouping of terms shown in eqns. (86) and (87). 

When the circuit is drawn in this manner the effects of incre- 
ments of current Ai and the absolute changes i become 
apparent.!? In the symmetrical component form, 


di = Ait + Teatir (88) 
becomes dif = Aif — jifAd (89) 
and 672 = Ai® + jiPAX (90) 


(These are the ‘absolute changes’ used by Ku22.) 
The significance of active and reactive power in the equivalent 
circuit is discussed in the following Section. 
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(7) THE TORQUE EQUATION AND THE EQUIVALENT 
CIRCUIT 
The machine torque equation can be written down from 
matrix (74), using the grouping of terms indicated by the tensor 
equation of torque [eqn. (69)], 


Af = {| - Mee — i)Aite* — by il* — mnie | 


+ [i’*.G%e. ivan} 


JG” 


a [ee , ry “ i” Bal i’* j Ge J i |an (91) 


Examination of the equivalent circuit meshes shows that the 
active and reactive power measured at the points indicated, 
namely AiV, give real and imaginary parts of the first set of 
terms in square brackets in eqn. (91), with a time-quadrature 
difference. Under steady hunting conditions the total torque 
expression can be written, as in Park’s reference frame, by 


Af = (Tsynch + JT damp) XA a 


and the real component of power as measured from the equiva- 
lent circuit of Fig. 3, namely the resistance loss in each mesh, 
corresponds to the damping torque given by eqn. (92). The 
imaginary part gives the corresponding component of (negative) 
synchronizing torque. 

The additional components of negative and positive synchro- | | 
nizing torque are given by the remaining two sets of terms in 
square brackets in eqn. (91). Components of these terms are 
included in the equivalent circuit, but they cannot be read off 
the circuit directly since the injected voltages and currents are | 
already associated with the displacement angle AA. Fig. 4 shows 


I iv; 


(92) 


Fig. 4.—Contribution of forward armature mesh to synchronizing 
torque. 


ifV;’ gives positive (impressed) synchronizing torque. 
ifV;”’ gives negative synchronizing torque. 


how the circuit can be interpreted to indicate the contribution of — 
each mesh to synchronizing torque. 
Positive synchronizing torque given by the tensor K expands to 


(— Brif + Byi®)AX . (93) 
This can be compared with the machine power output equation?2 
— Bri? + Byif (94) 


and it is seen that the positive synchronizing torque at any load | 
angle is, in fact, given by the reactive component of the machine | 
power output. The simple equation for synchronizing power of | 
a round-rotor machine given in textbooks*® is usually written q 


e cos A)AA H 


and the expression in brackets is again the reactive component 
of the steady-state vector power output. 


(95) 
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(8) CONCLUSION 


The equations of performance of any conventional electrical 
machine can be derived by an automatic tensor transformation of 
those of the primitive machine. A similar type of transforma- 
tion can be used to express the equations of a given machine in 
any one of several sets of reference axes. The transformation of 
Park’s equations for the synchronous machine to those in Kron’s 
freely rotating axes leads to simple overall equations for an inter- 
connected system. 

The complex interaction of currents and fluxes in a machine 
during hunting can be more easily followed in different reference 
systems when the corresponding transformation of equations is 
carried out using matrices and the routine methods of tensor 
analysis. The tensor form of the equations of hunting of a 
machine in the free frame has two advantages: 


(a) Synchronizing torque terms are inherently grouped together 
in terms of the angle of oscillation AA. Positive and negative 
damping terms are inherently grouped in terms of the increment 
of angular velocity Aw and give this component of torque as the 
real part of a complex expression. Damping torque is thus directly 
associated with the resistances in the electrical system. 

(6) The terms of the tensor equations in symmetrical component 
form give, directly, the meshes in an equivalent circuit. This form 
of equivalent circuit can be interconnected with corresponding 
circuits for an external network and additional machines. The 
resistance power loss in each mesh gives the damping torque con- 
tributed by that part of the system. 


The significance of tensor groups of terms in the steady-state 
equations of electrical machines has already been noted.> The 
tensor form of the hunting equations gives a physical picture of 
the hunting phenomena as represented by the equivalent circuit. 
It would appear that this method of formulating dynamical 
equations could be used to advantage in the analogue study of 

‘complex devices in which interchange of different forms of 
energy takes place, for example, aircraft, missiles and nuclear 
reactors. 
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(1) APPENDIX 
(11.1) Small Oscillation Equation 
The free-frame steady-state equation is 
—— Ragi® + Lugpi® + Vpy,al"i¥ ‘ 


This is divided into voltage and torque equations. 


(96) 
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Voltage equation. 

The index « is electrical. The indices 6 and y are electrical, 
indicating currents i® or iY, or mechanical values s, indicating 
angular velocity i’ = pé. 


i Ragi” + Lap pi? + Bee arty + Tes, a8%p0 (97) 
Torque equation. 
The index « takes the mechanical part of the range. The 
indices 8 and y have electrical values. 
V_ = RysiS + Lygpis + Vey, si®iv (98) 
or Us = RyspO + Jp?0 + Vey, siPir (99) 


The small-oscillation equation is obtained by taking small incre- 
ments of values in eqn. (96). This gives 


) Q 6 
Avy + she = RypAi® + Lypp(Ai®) + ALggpit 


+ Tey of Aiv + Ppy eAiPir + AT py, of®i¥ (100) 


The corresponding voltage and torque equations are as follows: 
Voltage equation. 

ow 
Avy, + she = RypAi® + Logp(Ai®) + DyyaA(p)i¥ 


ats Vee awAi®p8 ae te Ait pO sie Ve,, ol A(pA) 


ae 


+ Bp +2 sir’ sate “Api ryAxt + 8 78(pO)Axt . (101) 


The index ¢ denotes excursions of the rotor over the increment 
of speed, 
A(p8) = p(Aé) = p(Ad) 


The terms of the voltage equation are expanded in a manner 
indicated by the following example:> 


Ded VAit = 
{Ity, ot] — Scere — Say + Spyce + Query, + Quit, 
— {[ty,0] — Saye Oag piv¥Axt 


Other terms are zero as shown in Sections 4 and 5 of Reference 5. 


— Oy obit Ait 
(102) 


Now — Soy = G = Cy.G.C (103) 


where G’ is the free-frame torque matrix and G is the torque 
matrix in Park’s equations. 


20 eee ar 
ay = ga Cg) Eo = EL’ = 28x, (104) 
1/0L OL OL 1 OL 
CCM Pen Caernca ene BU 
se ity,@] IN Ope Ox’ Ox 2D Ose Oy 
The index s denotes the mechanical variable 0, and 
oD 6-6, or fe Ot 7h Oks 
= eetiitAe = (5 2° — Sue 4 Q,.,9)#i@Ax® (106) 
Lup af Lap 
x 0 pd =0 (107) 
5,885 = ea Ca -L =p) =0 (108) 
08 
Sasgii =4G'p0 (109) 
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OLS nce. OD G5,0..5.6 t / 
SYK og S/H rs = ———F : A 110 
Thus ag re iv¥Axt ae Ax mn! pd. Ad . (110) 
Therefore eqn. (101) becomes 
Av’ + San = = [R’ + Lp + G’p0)]Ai’ 
OL dG’ 
i’ 111 
+| Sin + 27 90 aa (111) 
Torque equation. 
AF = JpXAX) + Voy i®Ait + Day AiBiv + Ober) 
(112) 
Dey,siPAit = {[8y.5] — Sera — Sear + Sprs 
+ Oe + Ose = Ory, PAT (113) 
Heyes Ole 
[By,siPAiy = 5 30! Ai’ =0 (114) 
Gem Neher er eh usin ene Mi 
(Qyy8 + 2:67 — By,s)a iv = jon Cw eile Ni 0 
(115) 
(— Sse — Ssey + Seys)i@Ai¥ becomes? . (116) 
(= Save — Syd? 4.01 G20 Ais 4G) a 
Thus 
Poy, st Ar + Tey APit = Ai Gr =a Gr eee) 
or ; 
= SiiVAxt = i’ ae , Pane (119) 


Therefore eqn. (98) becomes (neglecting the friction term rn) 


Af = Ai’*.G’.i +i'*.G’. Ai’ + pene i’. AX + Jp(AA) 


OA 
(120) 
(11.2) The Tensor K3,¢q 
The absolute differential of a contravariant vector is 
dd' = df + Th pkdxi (121) 
Ores OD. . ae 
Therefore ACL = FAG + Tiikdx/) (122) 
= dain + Diikdxi) + D4, (dim + Dm irdxa ee 
wien bg haere | Peas ae 
dV 
=“ ait at ae Rdxi + 12 oe ax! Te i (ax!) 
Sa MF ed RET (123) 
iat di® di® 
Similarly 8(— = a(— Tea Pes (124) 
_a(e T8sivj8 Badivi ; 
= a(F) +d NS a DSsdivi8 + DSsivdxd 
+ ese dx? + TS Dr qitiBdxs (125) 
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Therefore 3(% ee FAG 2) 


ORs. ole 
See = ao idx? ae ie al Se i%iPdx™ 


5 
— VB.DAiniedx® + 1% d (ae = “ad |e . (126) 


The bracketed difference in expression (126) is not zero because 
the electrical variables are non-holonomic, i.e. they are related 


only through non-integrable differentials,° and 
02x5 02x5 
OXYOX® ~ YxPdx¥ * (2p) 
This is shown in Reference 27, as follows: 
oP 2 oP Ey MLSs oP sO} oP 
OxdxI = OxF Ox? J Ox Ix Ox Ix* 
oP "OCP ROG 
= jg Peis Ct) 
h A 
oP crc? oC}. ni a 
~ xh Oxi’ Oe 
oP oP 
= Sy 2Q4 = — $5, 204 (128) 
Oe eee 
where OF = 4c/c1(S - 54) (129) 
and in expression (126) 
rea ie ¥ Sax | = —T8yr 202 iraxe (130) 
With appropriate rearrangement of indices 
KY pidi¢dx = 3(= =) = cei i) (131) 
where 
OVS ore ee ot 
S78 = o a raat l ear ~ Dsl Sy ad 21H Q8, (132) 


The tensor equation of small oscillations now becomes, as 
given by Kron,!8 


Dog a 
Av, St bz, ae 5( Rat") = Leo (8i*) Sin Ksypal?iedx¥ (133) 


which expands, as shown in References 2 and 18, giving 


ov, di¥ ; di* 
Avy + eh? =— Dy3aGAe + RapAi® + Dyaa ZA 


oLap di® 
Ox’ dt 


+ Lap (Ai) — Axt + DoyaAiit + DoyaitAir 


— Try, «I's — 21 pa, G2’, ) PBAXY 


or 
= ( — ax Doaol Sy 


} Dove 
oxd 


Olan 
F (2 the 8 bet + TeV Bs — Pinal hy + 20 ono iO 
(134) 


This is seen to be the same as the conventional equation with the 
irae terms added and subtracted: 


or 
Dey & Ne -- (Tas, ol Ay = 21s, AOS, = = *) an (S5) 


In the synchronous-machine equations in the free frame the 
steady-state current is constant and 


A aT = (0) (136) 
BY, see 
The remaining terms expand in the voltage equation to 
YBa 
(Deeks aa 21's, oh dh, == a ibis 
1 (Tra oP ab + 205 OR a oe 28) iA (137) 


The index y takes the value ¢ indicating the mechanical variable 
which undergoes incremental changes, namely the load angle A. 
The index s as usual denotes the holonomic variable, the angular 
position 6, of the rotor. Only the first and fifth terms have 
non-zero values. The additional terms in the voltage equation 
are therefore 


(Pal ® 2 aOR esAn (138) 
= (Ths + Ds,0)phi®pOAr (139) 
= G’. pi’pOAA . (140) 
The corresponding term in the torque equation is 
(Tha,s + Der,s)psititAa . (141) 
=—i’.G’.pi’. AA (142) 


The small oscillation equations are therefore as given by Kron 
in Reference 19, namely 


Voltage equation. 


Av’ = [R’ + L’p + Gp8)Ai’ 


0G’, ? 
= pe —G’. 


. ei’. p8. AA 


e | POCA... (G67) 
Torque equation. 


Af = (De) [Ait Gera tA4G Ai sie Georan 


4 | ye dG’ 

ar) 

where AA is the displacement angle and Aw = p(Ad) is the 
angular velocity of displacement. 


G’. er |aa . (69) 
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SUMMARY 


The use of elliptical coupling apertures between two rectangular 
parallel waveguides for the realization of intrinsic directional couplers 
is investigated. It is found that both codirectional and contradirec- 
tional characteristics can be obtained. The possibilities of using the 
intrinsic directional property to improve the characteristics of multi- 
element couplers or to construct filter couplers are examined. 


LIST OF PRINCIPAL SYMBOLS 


x, y, z; a, b, d, p,q = Co-ordinates and dimensions relating to 
waveguides and coupling apertures, 
explained in Fig. 1. 

€ = Eccentricity of the elliptical aperture. 
Ao = Free-space wavelength. 
A, = Guide wavelength (Hp;-mode). 
A, = Cut-off wavelength (Hp;-mode). 
Aom = Wavelength for maximum directivity. 
k = Free-space wave number. 
E = Electric field strength. 
H = Magnetic field strength. 

E, = Peak value of the electric field in the 
secondary guide travelling in the positive 
z-direction. 

Ep = Peak value of the electric field in the 
secondary guide travelling in the nega- 
tive z-direction. 

P = Electric polarizability of the coupling 


aperture. 

M = Magnetic polarizability of the coupling 
aperture. 

p = Directivity, defined by the ratio of 
E,|Ep. 


n = Subscript denoting direction normal to the 
plane of the coupling aperture. 

! = Subscript denoting direction in the plane of 
the aperture, normal to the direction of 
propagation. 

E, F = Complete elliptic integrals of the first and 
second kinds respectively. 
f, = Bandwidth of the directional coupler. 


(1) INTRODUCTION 


Directional coupling between two waveguides may arise as a 
result of interference effects between several coupling apertures 
(interference coupler), or may be the consequence of an inherent 
asymmetry in the coupling provided by a single aperture 
(intrinsic coupler). It is possible to combine these two principles 
in a single coupler by using a series of interfering apertures, 
each one of which is itself inherently directional. Such a coupler 
can, for a given length, be designed to give a higher directivity 
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or maintain a specified directivity over a larger bandwidth, than 
either the purely intrinsic or purely interference type. It does 
not appear that such a coupler has been described, although the 
principle is of course frequently used in connection with antenna 
arrays. Another possibility is to use the intrinsic directional 
properties to couple a signal in one direction at one frequency, 
and to use the interference properties to couple a signal in the 
opposite direction at a different frequency. In this way, two 
frequency bands can be extracted from one waveguide and sent 
along two different paths, using only a single coupler. This is a 
type of ‘filter coupler’, examples of which, using microwave strip 
line, have recently been described.! 

The first requirement for the practical realization of such 
devices is an intrinsic directional coupler, in which the two guides 
are parallel. Now, interference couplers are necessarily of the 
forward type, in that the signal in the secondary guide is travelling 
in the same direction as that in the primary guide (codirectional 
coupler). Thus to realize a coupler with enhanced directivity 
or bandwidth, we must use a coupling element intrinsically 
directional in the forward sense, whereas the filter coupler will 
require an element directional in the backward sense (contra- 
directional coupler). In rectangular guides, a forward element 
can be obtained by the use of two crossed slots in the broad 
common wall, as shown by Surdin,” or by the use of a separate 
pair of slots, one series and the other shunt, as shown by Riblet 
and Saad.3 Crossed slots can also provide a backward element, 
but in this case it is necessary to offset the two guides laterally.” 
A simpler contradirectional coupler is the well known Bethe 
coupler,* which consists of a single circular hole placed on the 
centre-line of the common broad wall. Unfortunately the 
directional property can be obtained at a single frequency only, 
if the two guides are to remain parallel. It seemed, however, 
that a small modification to the shape of the hole might well 
allow an extension of the use of such a coupler to other fre- 
quencies. In fact, it was found that, if the aperture is made 
elliptical, dimensions can be specified to produce a reverse 
coupler, with theoretically infinite directivity at any given 
frequency above the cut-off frequency of the waveguides. 
Furthermore, if the elliptical hole is moved off the centre-line, 
it is again possible to obtain an intrinsic forward element which, 
by a suitable choice of dimensions, can be made to operate at 
any given frequency. The main part of this paper is devoted to 
an examination, both theoretical and experimental, of some of 
these properties of the elliptical-hole coupling element. A 
calculation illustrating the use of two such elements for the 
realization of a filter coupler is presented in Section 2.4. 


(2) THEORETICAL ANALYSIS 


The general theory of coupling between two guides through a 
small aperture has been developed by Bethe.> For a wave of 
unit amplitude, travelling in the positive z-direction in the 
primary guide, he finds waves of amplitudes E 4 and Ep, 
travelling in the positive and negative z-directions respectively in 
the secondary guide, where, for identical guides supporting the 
same mode,° 
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ik 
EA == (Pep Mp = MB)... *Gl) 
Hers 
Ep = — F(PEX+ MH? — MH?) . . (2) 
with S= | J E, x Hydxdy 


the integration extending over the complete cross-section of the 
guide, where E,,, H,, H, are the magnitudes of the (unperturbed) 
normal mode components at the centre of the aperture, and 
E,, H, are the real magnitudes of the transverse components 
of the fields in the primary guide. 

Eqns. (1) and (2) are valid for apertures which are sufficiently 
small compared with the wavelength, so that the variation of the 
magnitudes and phases of the fields over the aperture can be 
neglected. A further requirement is that the aperture shall be 
sufficiently far from the corners of the guide. For-rectangular 


SECONOARY WAVE 
AMPLITUDE Eq 


SECONDARY WA 
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a fe 
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Fig. 1.—Geometrical parameters of the elliptical-hole coupler. 


guides with an aperture in the broad face as shown in Fig. 1, 
eqns. (1) and (2) become 


ik 

jaye 55PE PAH? = MoH). . ©) 
jk > 

Ex = —5-(PE} + M,H?-M,H2) . . @) 


For an elliptical aperture, the polarizabilities are:’ 


1 
PS4t19¢ ) = 4npq?P’ 
; €2 ; ay 
ee ci | dt 


€2 


Ae 2 el 2nf/ 


M, = 


M, and M, are the components of the magnetic polarizability 
directed along the major and minor axes of the ellipse respec- 
tively. The functions P’, Mj, M are defined by the eqns. (5). 

For rectangular guides supporting Ho;-modes, there is only a 
single field component, H,, at the side walls, so that, as may be 
seen from eqns. (1) and (2), a coupling aperture would show no 
directive properties. On the other hand, for an elliptical hole 


positioned as shown in Fig. 1, 


Ey = R{ P - 0) Mz - (52) cot? (2) m| 6) 
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In these equations M, is equal to M; if the major axis of the 
ellipse is in the x-direction, and is equal to M; if it is in the 
z-direction, and similarly for Mj. The behaviour of the func- 
tions P’, M;, and M; is shown as functions of € in Fig. 2. 


os O-4 OS 6:8 
ECCENTRICITY OF APERTURE, € 


Fig. 2.—The functions P’, M; and M, as a function of e. 


(2.1) Conditions for Infinite Directivity 
The conditions for an ideal contradirectional coupler is that 
Ep should be zero, for a codirectional coupler that E, should 
be zero. From eqns. (6) and (7) it is easily shown that, for 
either of these conditions to be satisfied, 


iD), be Mia 

( My, + M; cot? (md/a) 
where the negative signs apply to the case E, = 0, and the 
positive signs apply to the case Eg =0. The right-hand side 
of eqn. (8) is a function of d and € only. These two variables 


must be restricted to values which make Ag real, and also less 
than the cut-off wavelength, A,. Hence we require that 


NMP. 
M,, + M; cot? (zd/a) 


(8) 


AI =>, 0, eee) 


Inspection of Fig. 2 reveals that the value of P’ never exceeds 
that of either M; or M3. This permits the condition of eqn. (9) 
to be expressed in the form 
P’ = Mi cot? ie (10) 

where the top and bottom inequalities refer to the cases Ey = 0 
and E, = 0 respectively. The limits on the possible choice of 
d and € imposed by eqn. (10) can readily be shown to lead to the 
results presented in Table 1. 

In this Table, the critical value of the eccentricity, €,, satisfies 
the following equation: 


P’ cot? (2) aM: (11) 
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Table 1 
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CONDITIONS FOR CODIRECTIONAL AND CONTRADIRECTIONAL DIRECTIVITY 


Conditions for E4 = 0 


Conditions for Ez 


=0 


Position of hole 


Major axis 
z-oriented 


Minor axis 
z-oriented 


Major axis 
z-oriented 


Minor axis 
z-oriented 


0 < dla < 0-250 
0-250 < dja < 0-304 
0-304 < d/a < 0-500 

dla = 0-500 


impossible 
impossible 
Orage —e] 
arbitrary 


impossible 
pe <a 
arbitrary 
arbitrary 


arbitrary 
arbitrary 
Gi a <I 
impossible 


arbitrary 
O<ex<e@ 
impossible 
impossible 


and the critical value, €5, satisfies 


d , 

P’ cot? (=) = Mi; (12) 

The Table shows the possible choice of € for a given position 

and orientation of the elliptical aperture to make either E, or 

Ep equal to zero. A number of significant facts can be deduced 
from this Table in conjunction with eqn. (9): 


(a) Any pair of values (d, €) will make either Ey or Ep equal 
to zero at some frequency. 

(6) At all frequencies above cut-off, both E, and Ez, can be 
made zero by two suitable (different) choices of the variables 
(d, €). 

(c) No pair of values (d, €) will make E, zero at one fre- 
quency and Ep zero at another. 


The last condition is interesting in that it implies that, in the 
case considered, it is not possible to realize a filter coupler with a 
single aperture. Nevertheless it may well prove possible to 
construct such a coupler with a differently shaped or oriented 
aperture. 

Some computations of eqn. (8) have been carried out for the 
case E, = 0 (contradirectional coupler). These are presented 
in Fig. 3, which shows Ao/A, as a function of ¢, with d/a as a 


02 


O1 0-2 03 O-4 045 Os O7 O-8 O4 Oo 


ECCENTRICITY, € 


Fig. 3.—Wavelength for maximum directivity of the contradirectional 
coupler as a function of the eccentricity of the coupling aperture, 
for a number of values of d/a. 


parameter. It is noted that, for any wavelength and d/a < 0:5, 
there are two values of the eccentricity which can be used. It 
can be shown that the design with the smaller value of € will in 
general be less sensitive to constructional errors in either d or e€. 


(2.2) Bandwidth 


The bandwidth of a directional coupler may be defined in 
terms of Ao,,, Ag; and Ago, the wavelengths at the two points 


where the directivity falls to a specified value p, according to 
the relation 
Xoo 


z= Aor a 


m 


(13) 


The two extreme wavelengths are readily found from eqns. (6) 
and (7). For the contradirectional coupler, 


*0)* (P’ — Mj) + p(P’ + Mj) 
(y [M; cot? (dla) — My] + p[M; cot? (ad/a) + Mj] 
(14) 


where Ay = Ag; for the positive sign, and Ay = Ag for the 
negative sign. 
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Fig. 4.—Percentage bandwidth over which the contradirectional 
coupler maintains a directivity in excess of 26dB, for a number 
of values of d/a. 
The dotted lines are contours of constant Ag/Ac. 


Fig. 4 shows some results computed from eqn. (14), for the 
contradirectional coupler, choosing p = 26dB. It is seen that, 
at a given frequency, a suitable choice of d and «€ allows con- 
siderable latitude in the choice of bandwidth. The dotted lines — 
in Fig. 4 are contours of constant Ay. It can be shown that the 
bandwidth of a Bethe coupler, for a given value of Apo, is given 
by the intersection of the appropriate contour with the dja = 0-5 
line. The elliptical-hole coupler can therefore always be designed 
to give a greater bandwidth than the equivalent Bethe coupler. 
The significance of this fact in practice is in some doubt because, 
for coupling apertures sufficiently large to give coupling losses 
less than 30dB, the theoretical predictions of bandwidth are 
subject to large errors. 
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Table 2 


COUPLING FAcTors, BANDWIDTHS AND WAVELENGTHS FOR MAXIMUM DIRECTIVITY, FOR DIFFERENT HOLES 


; 
Coupli | fi ‘som | ‘Om 
fi Aeoee Theory etter ( a one) ee 

1-35 0-550 | 0-50 oe | c | 
; ; : | 14 Dia3 | 9-50 9-60 9-60 
1:64 0-647 0-50 24 | — | = 9-10 9-27 9-62 
1-49 0-652 0-50 Di | 16:5 | 6:1 9-09 9-24 38, 
1-66 0-730 0:50 22 | 18-5 6-4 8-63 8-84 9-10 
1-88 OsTaT. 0:50 DS | 21-8 | =— 8-22 8-52 9-22 
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(2.3) Coupling Factor 


The coupling factor is numerically equal to Ep, for the con- 
tradirectional coupler, to E, for the codirectional coupler. It 
is proportional to pq? and, for small values of the eccentricity, 
it will be comparable with the coupling factor of a Bethe coupler 
having an aperture of radius (pq’)!/3._ The theoretical value of 
the coupling factor for the particular couplers investigated 
experimentally, calculated from eqn. (7), is indicated in Table 2. 


(2.4) Design of a Filter Coupler 


As one example of the use of an elliptical-aperture contra- 
directional coupler, one may take the simplest type of filter 
coupler with only two apertures. The results of a numerical 
example are shown in Fig. 5. This coupler has been so designed 
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Fig. 5.—Directivity of a coupler with two apertures spaced L apart, 
each intrinsically codirectional at Ag = A2, and together contra- 
directional (interference) at Ao = A}. 

L = dg at Ao =);\ dla = 
L= akg at Ao = Ao S Ag/Ac = 


0-5 

0:5 

that the apertures are separated by $A, at the frequency for 
which E, =0, and by 4A, at the frequency for which 
E,=0. This ensures that there is no deterioration of the 
intrinsic performance as a result of interference effects, but is 
not, of course, a necessary design condition. The performance 
shown in Fig. 5 could be modified to give greater or less band- 
width by the addition of further apertures. 


(3) EXPERIMENTS 


(3.1) Experimental Apparatus and Method 


The main purpose of the experimental work was to gain some 
idea of the limitations of the theory arising out of the assumptions 
made. The experiments were largely confined to the determina- 
tion of the frequency for maximum directivity for the con- 
tradirectional coupler. Measurements were made between 


3-0Gc/s and 3-75Gc/s, using standard WG10 waveguide 
(2:84in x 1-34in internal dimensions). Both the primary and 
the secondary guides were machined out of the solid. The 
common wall was a sheet of copper, 0:030in thick, clamped 
between the primary and secondary guides. This method of 
construction allows the rapid interchange of coupling apertures, 
and permits the use of a very thin common wall without the 
danger of distorting the guides. The elliptical apertures were 
cut by hand; dimensional errors did not exceed 3% of the 
major axis. A difficulty is to ensure that the transition to normal 
waveguide at the ends of the copper sheet does not produce 
excessive reflections. Preliminary calculations indicated that, if 
the inevitable gap could be reduced to less than 0:010in, the 
reflections would not be significant. Measurements on the 
primary guide, using a copper sheet without any coupling 
apertures, showed that the voltage standing-wave ratio (v.s.w.r.) 
never exceeded 1-il. Since it is only the reflection at the far 
end of the coupler which can affect the measurement of directivity 
it may be concluded that an error due to this cause would not 
be significant for directivities less than S5O0dB. A somewhat 
greater difficulty was the need to accommodate a load with a 
sufficiently low v.s.w.r. in the restricted length available in the 
secondary guide. Using four staggered resistance-card attenua- 
tors, the v.s.w.r. could be kept below 1-1 over the complete 
experimental bandwidth. 

Fig. 6 shows a block diagram of the measurement system. The 
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Fig. 6.—Block diagram of the experimental apparatus. 


directivity was determined by a substitution method, using a 
calibrated variable attenuator. The change in attenuation in the 
primary guide, required to keep the signal level detected in the 
secondary guide constant on reversing the coupler, was measured. 
The klystron was square-wave modulated at 3kc/s, and the 
rectified signal from the crystal detector in the secondary guide 
was amplified by a selective amplifier (bandwidth 30c/s) and 
displayed on a cathode-ray oscilloscope. After eliminating 
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various sources of microwave and audio-frequency leakage, the 
detector sensitivity was limited by noise at a power level of 
about 70dB below that fed into the primary guide. Coupling 
factors of the order of 30-40 dB were used in the experiments, 
so that the maximum measurable directivity, as limited by 
signal/noise ratio, was not less than 30 dB. 


(3.2) Experimental Results 


The directivity was measured as a function of wavelength for 
each aperture. A typical experimental curve is shown in Fig. 7. 


36 —_—_—_- 


32 


is] 
a 


24} 


aS 


DIRECTIVITY, dB 


ial 
8O 85 oO 95 Le} e) 
Ao,cm 


Fig. 7.—Experimental curve of directivity as a function of wavelength 
for an aperture with d/a = 0-5. 
e = 0-730. 2p = 1-66cm. 
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Fig. 8.—Measured wavelength for maximum directivity compared 
with the theoretical curve. 
@ 2 =1-13cm. 


In this, as in all other cases, the maximum directivity was too 
great to be measured, but the wavelength for maximum directivity 
could be deduced with an uncertainty less than +4°%%. 

Fig. 8 shows the results obtained for a series of apertures with 
constant minor axis, and for a series with constant major axis, 
compared with the theoretical curve. As may be seen in Table 2, 
the coupling factor does not vary over very wide limits for the 
apertures in one series. A single experiment was performed in 
which the wavelength for maximum directivity was measured 
for an aperture off the centre-line, and subsequently for an 
identical aperture on the centre-line. The result is shown in 
Fig. 9. Table 2 lists the geometrical parameters of the apertures 
investigated, and compares the experimental results with the 
theoretical results for the frequency at which the directivity has 
its maximum value. 


(4) DISCUSSION 


Comparison with theory shows a general shift of the experi- 
mental points to higher wavelengths. This shift is attributed to 
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Fig. 9.—Measured Ao» for two positions of similar holes compared 
with the theoretical curves. 
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the finite size of the holes, and indicates that the effect of the 
variation of the magnitude and the phase of the fields over the 
aperture, neglected in Bethe’s theory, is in fact not insignificant. 
The unperturbed field has a phase variation proportional to the 
length of the aperture, and a magnitude variation which is a 
function of the width. Although it is of course the variation of 
the perturbed fields which is relevant, one can form some 
tentative impression of the relative importance of the two effects 
from Fig. 8. Since the experimental results do not run parallel 
to the theoretical curve with either p or g constant, it would 
appear that, for the range of apertures used, both effects are 
significant and lead to deviations of the same order of magnitude. 
A qualitative explanation for the discrepancies revealed is not 
hard to find. For an aperture on the centre-line, the coupling 
produced by the z-component of the magnetic field is entirely 
neglected by thé theory since its mean value over the aperture 
is zero. Yet for apertures of the dimensions used, the value of 
H, at the edge of the aperture may amount to as much as 2594 
of the value of H,. Since the coupling is proportional to the 
square of the amplitudes of the normal mode components, the 
effect of H, will, for the contradirectional coupler, be such as to 
reinforce that due to H,. Inspection of eqn. (8) will show that 
this would indeed result in a shift of the wavelength for maximum 
directivity to longer wavelengths. One might expect that the 
order of magnitude of this effect would be predicted by using the 
r.m.s. value of H, over the aperture in eqn. (8). The eighth 
column in Table 2 has been calculated in this way. It is seen 
that the correction is somewhat less than, but of the same order 
of magnitude as, the discrepancy. A more accurate estimate of 
the effect of the H,-component would require an extension of 
Bethe’s theory to fields which are not constant over the aperture. 
A further departure from the theoretical assumptions is the 
phase change along the length of the aperture, where this is an 
appreciable fraction of the guide wavelength. The sense of the 
error incurred can again be deduced by a qualitative argument: 
the field radiated into the secondary guide by various sub-areas 
of the aperture will have different phases. This will result in a 
directional effect of the interference type, in the opposite sense 
to that produced by the intrinsic action of the elliptical aperture. 
Thus the backward wave is weakened, the forward wave remain- 
ing substantially unchanged. This effect clearly could not alter 
the frequency at which the directivity reaches its maximum value, 
in the idealized case where this maximum is infinitely large. In 
practice, the presence of losses ensure a finite directivity and the 
reduction of the amplitude of the backward wave will result in a 
further loss of directivity, an effect that will increase rapidly with 
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frequency. In consequence one observes a reduction in the fre- 
quency at which the maximum directivity is obtained. 

The coupling factors were not measured accurately, but were 
always 5-10dB higher than the theoretical values listed in 
Table 2. This again is undoubtedly due to the limitations of 
the theory for large apertures, as is the large discrepancy between 
predicted and measured bandwidths. These discrepancies are 
also encountered in the design of conventional Bethe couplers. 


(5) CONCLUSIONS 


It has been shown that, by suitable choice of the position and 
eccentricity of an elliptical coupling aperture in the common 
broad wall of two parallel rectangular waveguides, an intrinsically 
directive coupling can be obtained. This coupling may be either 
codirectional or contradirectional. The frequency at which the 
directivity has its maximum value is restricted only by the cut-off 
frequency of the guides. Within certain rather wide limits the 
bandwidth may be specified separately. 

Experiments with contradirectional elliptical apertures has 
shown that the frequency of maximum directivity, for apertures 
with coupling factors of the order of 30dB, can in most cases 
be predicted from theory with an accuracy of about 5%. The 


frequency at which the directivity has its maximum value as . 


predicted by theory always exceeds the measured values, a dis- 
crepancy which is readily explicable in sign, and order of mag- 
nitude, by the variation of the field quantities over the aperture, 
which is neglected by the theory. 

Considerable discrepancies are noted between measurements 
of the coupling factor and bandwidth, and predicted values based 
on the small-aperture theory, and are attributed to the inaccuracy 
of the latter for the comparatively large apertures used in the 
experiments. 
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An example is given of a two-aperture coupler, using elliptical 
apertures, which is contradirectional at one frequency and 
codirectional at another, and its performance is calculated. 
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SUMMARY 


The paper examines the detection of a signal received upon a noise 
background, and determines the transfer functions of the networks 
which maximize (a) the ratio between the average amplitude of n 
successive samples of the output signal and the r.m.s. output noise, 
and (6) a continuous sample of the output. The optimum ratio is 
calculated when the input is a rectangular pulse upon white noise or a 
cosine pulse upon non-white noise. 


(1) INTRODUCTION 


The detection of a signal in the presence of noise is achieved 
by observing some quantity whose value in the absence of the 
signal is measurably different from that when the signal is present. 
In many instances it is convenient to choose as the observed 
quantity some function of the amplitude of the signal plus noise 
at the output of the detector. In the absence of the signal the 
observed quantity fluctuates with the noise background. Thus, 
for a weak signal the detection efficiency depends upon the 
amount of change in the observed quantity which constitutes, 
with reasonable probability, an indication of the presence of the 
signal. The inclusion of a further network to increase the ratio 
between the observed quantity when the signal is present and its 
value when the signal is absent may facilitate detection; the 
consideration of such a network is the purpose of the paper. 

Dwork! obtained an expression for the transfer function 
required to produce at the output the greatest possible value of 
instantaneous peak signal amplitude consistent with a fixed mean 
output noise power. The resulting ratio between the peak 
signal and the r.m.s. noise is 


Pea ite alone 
P2, ln eae. ° . ° . (1) 


where |F(w)|? and |o(w)|? are the power spectra of the signal 
and the noise respectively. The transfer function which leads to 
the above value of P./Py is 


F*(w)e—JoF 


“Alo(w)|2 . . . . ° (2) 


where A is an arbitrary constant, T is the time of occurrence of 
the peak, and the asterisk denotes the complex conjugate. 

Dwork’s transfer function is not always realizable by a physical 
network. 

With white noise the condition for physical realizability of 
Dwork’s optimum transfer function is that the entire input signal 
must have entered the circuit prior to the sampling time at which 
the peak signal is to be produced. This condition is also obvious 
intuitively, since it is equivalent to the statement that the process- 
ing circuit in the receiver output cannot make the best use of the 
signal until the entire signal has been received. It follows that, 


H(w) = 
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for a signal of finite duration received upon white noise, the 
realizability condition may always be satisfied by the choice of a 
sufficiently large sampling time. 

When the signal is not of finite duration the realizability con- 
dition implies that the sampling time must be chosen at infinity. 
If the signal amplitude tends to zero with increasing time, a 
circuit designed to process a sufficiently large time sample of the 
signal will lead to a value of Ps/Py sufficiently close to the 
optimum value given by eqn. (1). In practice, the necessity of 
evaluating the receiver output at a finite time implies that the 
processing circuit must be designed to deal with a finite sample 
of signal and hence with an input whose amplitude may be 
supposed to be zero after the lapse of a sufficiently long time. 

Zadeh and Ragazzini* have shown that a physically realizable 
transfer function to produce a maximum ratio between the peak 
signal and the r.m.s. noise at a chosen time T is given by 


1 oul Sark e(o, ae 
Hi ee ee jot NI pjoit—T) 
i Pat dt oxesonae does) 
The function o;(w) must be chosen so that |o,(w)| = |o(w)| 


and o;(w) has no poles or zeros when %w > 0. 

Since eqn. (3) is more cumbersome to evaluate explicitly than 
eqn. (2), it is desirable to make direct use of eqn. (2), and hence 
eqn. (1), whenever possible. It is accordingly desirable to know 
the least restrictions that must be placed upon the signal and the 
noise in order to permit physical realizability of the transfer 
function (2). However, a discussion of the condition for 
physical realizability of Dwork’s transfer function when the 
noise is not white is not at present available in the literature. 

The theories of Dwork and of Zadeh and Ragazzini concern 
the production of a single signal peak that is instantaneously as 
large as possible in comparison with the r.m.s. noise. In many 
instances a more efficient method of detection results if the output 
is sampled over a continuous interval. It may then be desirable 
to include a network whose function is to produce an extended 
signal peak whose value averaged over a given finite time is as 
large as possible in comparison with the r.m.s. noise. 

An alternative method of detection is in terms of the average 
value of n successive samples of the output chosen at equal 
intervals. The theory in the paper is illustrated by the deter- 
mination of a transfer function to maximize the ratio between 
the average of n successive samples of the output signal and the 
r.m.s. Output noise. As the number of samples is increased and 
the time interval between them is decreased, the average of the 
discrete samples approximates to that of a continuous sample. 
The problem of deciding whether or not a given sequence of 


samples indicates the presence of a signal has been discussed by 
Slattery.? 


(2) OPTIMUM TRANSFER FUNCTION 


Adapting the notation of Dwork,! the signal input, V(t), and 


output, V(t), of the network at time tf may be represented in 
the forms 


Vi) =(lJ2m)| Fedo . . 6. 
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Vo(t) = (fon) | Fe)Hwedoe . 2 . ) 


where Foy { Viedr . . . . . 


and H(w) is the transfer function of a network chosen to process 
the pulse in a manner to facilitate detection. If the power 


spectrum of the input noise is |o(w)|?, the mean square noise 
output is given by 


P2, = (12m) | lotyst(e|2deo . ea ka) 


Suppose that the transfer function H(w) is to be chosen such 
that, for some specified function G(w), the quantity 


PAT) (jon) J F(w)H(w)Gw)eTdw . . (8) 
is as large as possible compared with P*. Then H must be 
chosen such that 3Q/)H = 0, where 

2 = (1/277) [F(w)H()Gw)e/T — Alo(w)H()|?] . ©) 


and A is a Lagrangian multiplier.4 The condition )0/)H = 0 is 
fulfilled when 


F()G(w)e/T — dlo(w)|2H*(~) =0 . . (10) 
| __ F*(w)G*(w) eT 
Thus H(w) = Noto? (11) 


leads to a stationary value of Q. 
_ When A(w) is chosen according to eqn. (11), then P;(T) of 
eqn. (8) has a value, Ps, given by 


|F@)|?|G@)|? 


Ps = (1/2n)| Nowaye 22s 2) 


and so the value of Ps(T)*/P% is 


F)716@)?* 1, 
|o(w)|? 
It will now be shown that the stationary value of P%/P% 


according to eqn. (13) is indeed the maximum value of Ps(T)?/P%. 
It follows from eqn. (8) that 


P2(P2, = (12m) | (13) 


|P<(T)| < U/2) [FH @)G@)|de . - (4) 


[f |F(@)H(w)G(w)|dew ? 
J |o(@)H(@)|7dw 


Hence |P5(T)/Py|? < (1/27) (15) 


Now by Schwartz’s inequality* for any functions x(w) and y(w), 


[f x(w)y(w)dw 
f | x(w)|*dw 


Putting x(w) = o(w)H(w) and y(w) = F(w)G(w)/o(w), eqns. (15) 
and (16) show that 


<flyw)idw . . . 6 


© |F(@)|?|G(@)|7d 
lPDIPyP< apm | Pecos an 


and hence P2/P% as given by eqn. (13) is indeed the maximum 
value of P.(T)?/Px. 

In order that the transfer function H(w) of eqn. (11) should 
correspond to a physically realizable network it is necessary that 
the output response, Vo(t), to a unit impulse input, V,(0), at 


time t = 0 should be zero for t< 0. Now, with H(w) chosen 
as in eqn. (11), the output resulting from a unit impulse input is 


© F*(w)G*(w)esoe-T) 


V(t) = (1/277) tek A|o(w)|? 


dw . (18) 


Substitution of F(w) from eqn. (6) enables eqn. (18) to be written 
in the form 


Volt) = | eMeyhr — ¢ — 1dr, det = ado} 
where Ore It Teepe ee ee 
and g(t) = (2m) | Gwyelotde eel 


The condition that Vo(¢) = 0 for t < 0 is thus equivalent to the 
condition 


J econ —t)dty=0 for *>T . . . @2) 


The function g(t) represents the response P(t) to an impulsive 
input in the absence of the processing network. The left-hand 
side of eqn. (22) is the mutual correlation function of g(t) and 
the Fourier transform of F(w)/A|o(w)|?.. The condition for 
physical realizability of the transfer function (11) is thus that the 
mutual correlation function is zero for t > T. 

In many instances the specified function G(w) is the transfer 
function of a filter of finite memory, so that g(t) is zero for 
sufficiently large values of ¢. Suppose, in fact, that there exists 
a ty such that g(t) = 0 for t > fp. The condition (22) is then 
satisfied by any function h(t) for which A(t) = 0 when t > T — fo, 
and hence is satisfied whenever 


© F ; 
| aoe = 0 for st — eS) 


By a theorem of Wiener? the condition for the existence of a 
finite time T for which eqn. (23) may be satisfied is equivalent to 
the condition 


fe f tog | ON Jaen < 00 - . (A) 


—~ol +a? |o(w)|? 


The form (24) of the realizability condition is a precise formula- 
tion which includes the intuitively obvious condition that there 
must be no frequency band over which the noise vanishes but 
the signal does not. The existence of such a frequency band 
would imply that P;/P,, could be made infinite by the use of an 
ideal rectangular band-pass filter. The problem discussed in 
the paper would then be replaced by that of approximating the 
ideal rectangular filter by a realizable one with a finite number 
of circuit elements. P,;/P,, could be made as large as required 
by the choice of a sufficiently close approximation. 

To illustrate the above remarks, consider the case of an input 
rectangular pulse of duration ¢, received upon a noise background 
for which |o(w)|? = w?/(w* + w?) where w, is a constant. 
With the time origin chosen at the centre of the pulse, 
F(w) = 2A (sin 4wt,)/w, where A is the height of the pulse. 
The functions |o(w)|* and F(w) are shown in Fig. 1. The 
condition (24) is not fulfilled, because of the infinite behaviour 
of the integrand at w = 0. However, Ps/Py may be made as 
large as desired by the use of a sufficiently narrow low-pass filter. 

Likewise, if a rectangular pulse is received upon a noise back- 
ground for which |o(w)|? = w?/(w? + w3), eqn. (24) is not true 
because of the behaviour of the integrand at w = ©, In this 
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Fig. 1.—Graph of F(w) = 2A (sin Jota)/o and |o(w)|2 = w?/(2 + «)- 


instance P;/Py may be made as large as required by the use of a 
sufficiently high-pass filter. However, if a cosine pulse of shape 
V(t) = cos (rt/t,) of duration —4t, << t < 4f, is received upon 
a noise background for which |o(w)|* = w?/(w* + 3), then 


_ QaIt,) cos tut, 
Git? — w 


and the condition (24) is fulfilled. The graphs of |o(w)|? and 
F(w) are shown in Fig. 2. 


F(w) 


wt /2 OR wip 


Fig. 2.—Graph of F(w) = (27%tg)[(7]ta)2 — w2]-1 cos $wtg and 
|o(co)|2 = ef /(w2 + 5). 


If G(w) is chosen as 


G@) = 1 + e Jor 4 eHow 4 | 4 ge NJor 
eT ies (25) 
sin 4wT i 


then P;(7) represents the sum of ” samples of the output chosen 
successively at intervals 7. Also 


g) = VO + Vie —7) + Vit —27)+...4 Vi[t —G@— Dr] 

and eqn. (22) takes the form 

A(t) + h(t —7) + h(t — 27) +... + Alt —@ — 17] = 0 
Oe PSS IP (26) 

which implies that T must be chosen such that h(t) = 0 for 
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t> T — (n — I)r. 
fixed values of ty 


Putting m7 = ft, and letting t —> 0, then for 


TG(w) — —tjota (27) 


2 sin totg, 
WwW 


and tP,(T) represents the output integrated over the time fy. 
21 : 
For white noise |o(w)|? = 1, A) = * V;(t), and the condi- 


tion (26) implies that the entire signal must have entered the 
network before the first sample is chosen. 

If the noise is not white and it is impossible to choose a finite 
T to satisfy eqn. (22), the method of Zadeh and Ragazzini? may 
be applied to show that the optimum realizable transfer function 
for a given T is 

O R* * 
HGS) = “iota | FevGiey (ov 


ane me 
1 1 


1 foe} 
277A (w) ‘ 
where o,(w) is chosen as in eqn. (3). It is to be noted that the 
transfer function (28) is, in general, optimum only for a fixed 
value of 7, while the function (11) is optimum over all values of T. 


= 00 


(3) SAMPLES OF RECTANGULAR PULSE UPON WHITE 
NOISE 


If the input signal consists of a rectangular pulse of duration 
t, then, with suitable choice of the origin of time, 


A for |t|< 44, 
ys 0 for |t| > 4, 2) 
Therefore Fw) = 2A (sinjwt,)/w . (30) 


and the total energy contained in the pulse is w = A?t,. 

To determine the network function which maximizes the sum 
of n samples of the output at intervals 7, put 
Gu) = SAMO sya 

sin 4wr 

The quantity P;(7)7 represents an approximation to the time 
integral of the output amplitude over a time nr. The similar 
integral of the r.m.s. noise is n7Py, and so P,(T) should be com- 
pared with nPy. For a fixed n the maximum value of 
P;(T)|nPy is Ps/nPy, where, according to eqn. (13), 


GB) 


A’ ¢ © sin? Let, sin? dnwr 
Mi) Pfr Poa a 
nes BSED 2m? J_ (4wt,)* sin? tr ie G2 
_ 2w f° sin? x sin? (nrx/t,) 
5 ah sin Gai 


where x denotes twf,. In eqn. (32) it is assumed that 
|o(w)|? = 1; thus w denotes the ratio between the input signal 


energy and input mean square noise amplitude per unit 
bandwidth. 


Evaluation of the integral (33) leads to the following formula 
for M,(7) (see Section 6.1): 


M,{r) = (w[n?)[3 + 4(n — 2) + 3) 


— C/6)n@@ — 1) + 1)(7]tQ) 
n—1 
ale Py kll—@— k)(7]tg)|] (34) 


which is plotted as a function of 7 in Fig. 3 for several values of n. 
The maximum signal/noise ratio that can result after integra- 
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Fig. 3.—Optimum ratio between the average of m successive samples 
of the output signal amplitude and the r.m.s. noise amplitude when 
the input is a rectangular pulse upon white noise. 


tion of the output over a time tz is obtained by use of eqn. (27). 
The resulting ratio is 


PR 1248 | © sin? x sin? (xtg/t,) 
a = e = <a) 35 
M(a) ear TA? ah aan xA vf > 
which has the values (see Section 6.2) 
w( = oe) if ty <i te 
30%) 
= (39) 


te el BENS ie 
—— (= fare t 
wand vl AGS We Lip ls 
The function M(t,) is shown by a broken curve in Fig. 4; 
the continuous curves show M,(7) = M,,(ta/t,) for n = 2-5 and 
are included for comparison. Examination of Fig. 4 shows 


1-0W 


tg/tg 


Fig. 4.—Optimum ratio between the average output signal amplitude 
and the r.m.s. noise amplitude when the input is a rectangular 
‘pulse upon white noise. 


‘that the average of a series of discrete samples of the output is 
approximately equivalent to a continuous average, provided 
that the time interval between successive samples is less than the 
length of the input pulse. 
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(4) CONTINUOUS SAMPLE OF COSINE PULSE UPON 
NON-WHITE NOISE 


If the input signal is a finite pulse of cosine shape given by 


V(t) = Acos (at/t,) for |t| <4, 
0 for |t| > 41, or dal) 
EY 
aon Fy 277A costwt, (38) 


tq (a/t,)* a 
and the total energy contained in the pulse is w = 4.471,. 

The cosine pulse may be regarded as approximating to the 
shape of an originally rectangular pulse which has been trans- 
mitted through a propagating medium and has suffered distortion 
in the form of a rounding of the discontinuities at the beginning 
and the end. 

Suppose that the pulse is received upon a noise background 
for which |o(w)|? = w3/(w% + w*), where w, is the value of w 
at which the mean square noise per unit bandwidth of w is equal 
to one-half of the d.c. component. 

By eqns. (27) and (13) the maximum signal/noise ratio for a 
continuous sample of the output integrated over a time fy is 


P2 8A” cost4twt,  sin?dwty we + w 
Ma) apa = ae NS ifs Gal) eco aa 
(39) 
Thus (see Section 6.3) 
_ 4wt, (we + 7/22) 
M(tg) = 2 Ab 208 COS (atq/ tg) 
t, (we + 7/02). t, Bw? + 72/12). 
Z 2 sin? (artg/2t,) : Trae sin (7t4/t,) 
if ty < t,, and 
4w ft t t, (wr + wt 
= 9) ‘a a ( b 2) 
M(a) a | ty + 2ty w? Se) 
iftz= 2) 


The function M(t,) is plotted as a function of tg for various 
values of w, in Fig. 5, the appropriate value of w,/(/t,) being 


M(tq) 


Fig. 5.—Optimum ratio between the average output signal amplitude 
and the r.m.s. noise amplitude when the input is a cosine pulse 
upon non-white noise. 


indicated on each curve. The curve for which w,/(z/t,) = © 
applies when the input noise is white. The value of W indicates 
the ratio between the input signal energy and the input d.c. noise 
power. This definition of W is consistent with that adopted 
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in Figs. 3 and 4. It may be noted that for Figs. 3 and 4 the 
total input noise power is infinite, while for Fig. 5 the total input 
noise power is equal to 4w,. 

Comparison of Figs. 4 and 5 indicates that, when the con- 
tinuous sample of the output extends over a time less than the 
duration of the input pulse, it is possible to design a much more 
efficient optimum network to deal with a cosine pulse than with 
arectangular pulse. For a cosine pulse the optimum signal/noise 
ratio is approximately the same as for white noise when 
w,|(7/t,) > 2 but is significantly higher for smaller values. 
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(6) APPENDICES 
(6.1) Proof of Eqn. (34) 


The integral in eqn. (33) may be evaluated by the following 
procedure. First divide sin (7x//,) into sin (n7x/t,) by means 
of formulae (403.11) or (403.13) of Reference 6. Then express 
the integrand of eqn. (33) as a sum of terms of type (sin? kx)/x? 
and integrate according to the formula 


© sin? kx 
I (41) 


Loe dx = |k|n/2 


Proof of the general formula (34) is very tedious and is outlined 
below for the specific instance in which n = 3. Since 
Sina Ta) ae 3 
sin? (7x/t,) [ 


4 sin? (rx/t,) |? 
then 


sin? x sin? (37x/t,) _ 3 sin x — 4 sin x sin? (rx/t,)? 
x2 sin? (rx/t,) x 


_ [sin x + sin (1 + 27/1)x + sin(l — 27/t.)x |? 

x 
me Sine 4 sine (7/h x 
2 eee 


_ sin? (1 + 27/t,)x 
T x2 


7) 
4 2 sin? (1 a T|t,)x ve sin? (1 — 27/t,)x 
Be x 


ae sin? (1 —z/t,)x _ 2 sin? (2r/t,)x 
x2 x2 


(42) 
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Thus, by eqn. (41), 


© sin? x sin? (37x/t,) ‘ ( Ar | 2r | T ) 
- Vdx =407(6 —— 4+ |1 ——}/4+2])1-—- 
J x2 sin? (Tx/t,) Ce ie ty ta| 
and hence | | 
w aT ae 27| Te ) 
= a 1——| +2)1 —— 43 
wa -Ho-H)+)-E ht) 
in agreement with eqn. (34). 
(6.2) Preof of Eqn. (36) 
The integral in eqn. (35) is 
(20) 8%) +2 
Tee i _ sin (tal fe) oe (44) 
Se x 


0) 


By the convolution theorem for the Fourier cosine transform’ 
eqn. (44) may also be written in the form 


I =| u(y)v(y)dy (45) ) 
0 
© sin? x | 
where u(y) = (2/n)1? | 5— cos xydx 
On oY | 
oO -:; 2 
and wy) = (2/m)12 | sin” (tate) cos xydx 
a x 
The explicit values of u(y) and u(y) are respectively® 
mule 
—~) (—4y) ify<2 
uy) = Gas (46) 
0 ify>2 
and 
(@) "te [t.) —4y] if y<2tg/t 
u(y) ie D) dla PA dl*a (47) 


| 0 if. y= 2h yft, 


If expressions (46) and (47) are substituted into eqn. (45) the 
integration is trivial and leads to 


an 14 d 
> a(t — x4) iN Ty Toe 
iad Tt 1 Oo) 
ae (fe eres 
pas tiie 


(6.3) Proof of Eqn. (40) 

For real values of w the integrand of eqn. (39) is the real part 
of the function 

f(w)(@}, + w?) 


{ co? = \(Gaifi,)*-o*wp ee 


42(w) = 


(49) 


4 fe 


tg ta 


Fig. 6.—Path of integration of z(w). 
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Where Boor t4 (w? + 7/22) ) 
f(w) =1 + eltae — gitao — eiltattido Lett (50) as on [tq — (tg — tg) COS (1ty/ tq) 
| = 2] 2 
‘In the above definition of f(w) the plus sign is to be used if -}| +] =< 2Az Gels fait) sin (7t,Jt,)| if tg < t, p52) 
tg < t, and the minus sign if ty > ¢,. 7 (w+ m2 
For complex values of w the function z(w) has no singularities —t2 (w? + w/t) - 
inside the contour shown in Fig. 6. The integral of z(w) around A E L273 w? i iene 


J 


the large semicircle tends to zero as the radius of the semicircle 


tends to infinity. The integrals around the small semicircles Integration of z(w) around the contour in Fig. 6 shows that 


| tend to the following values as the radii tend to zero: ca De eee 
a , cos*4wt, sin? 4wtg w3+w 
=i) : dw = — 53 
as g IE fig < Up, | [ose Grit) |. oe w? oy) 
= WER aieae(55) he sins 
Clie Substitution of integrals (51) and (52) into eqn. (53) leads to the 
4 if tz > t,4 
Cc 73 j result (40). 
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sin 6 : ; 
where sinc 6 = Be = Tht; Y = Tftz; which obviously 


DISCUSSION ON THE 


Mr. E. L. R. Webb (Canada: communicated): During the 
course of correspondence with Prof. Heaps it became apparent 


that there was a subtle but definite point of difference in the reduces to the same form as would eqn. (13) for ty = 


presentation of radar and sonar signals on type ‘A’ scans. 
Whereas in radar, the observer’s reaction bandwidth is almost 
invariably (Moon radars excepted) the narrowest in the system, 
29 sonar it may occur that the observer is relatively wideband. 


Similar (and even more unwieldy) expressions result for other 
pulse shapes and non-white noise. It is a general result, how- 
ever, that for vanishingly small values of f, (instantaneous 
sampling) all signal/noise ratios remain finite. 


in other words, the radar observer integrates over a number of 
| sweeps, whereas a sonar observer may make several ‘partial’ 
_fadgments during the course of a single sweep. The result is 
_a tendency for a radar man to use a different definition of Prof. 

Heap and Mr. McKay’s function G(w). Specifically, if G(w) 
"were instrumented by a network which behaved like a ‘tapped 

delay line adder’ the signal would still be given by eqn. (12). 


Prof. H. S. Heaps and Mr. M. R. McKay (in reply): We are 
grateful for Mr. Webb’s remarks concerning the differences in 
the presentation of radar and sonar signals. 

Throughout the paper it is supposed that the signal and noise 
are passed through a single filter of transfer function H(w) and 
that G(w)/n represents the process of averaging n samples of the 
output signal amplitude. The average of the 1 samples is com- 
pared to the r.m.s. output noise, and the resulting ratio is found 
to have a maximum value given by Ps/nPy. 

In Mr. Webb’s communication it is supposed that the signal 
and noise are passed through a filter followed by a tapped delay 
line adder of transfer function G(w). The quantity P/N, 
represents the ratio of the signal output of the adder at a single 
@ instant to the r.m.s. noise output of the adder. It should be 

f | F(w)|?| G(w)|4 is noted that since Mr. Webb assumes H(w) to be chosen according 
N. iow)? 2m to eqn. (11), then H(w) maximizes the ratio of the signal output 

=F of the adder to the r.m.s. noise input of the adder. The maximum 
value of Ps/N,,; occurs when dQ’/)H = 0, where 


|F(w)|?|G(w)|? dw 
A|o(w)|? 2a 


Te 


but the noise output would become 


and eqn. (13) would be replaced by 


Q’ = (1/27) [F(@) H(a)G(a)elT — dl o(w)|7| H()|?|G)|?] 


|FP?|G|? dw 
Srna fice ( tio) (J |o|? Qn The condition }Q’/0H = 0 implies that 
ignal/noise (power ratio) = SS 
|FLAGT dw F*(w)G*(w)eJoT 
|o|? 20 H(w) = 


A| a(w)|?| Ga) |? 


For the case of rectangular pulse, white noise and integration 


; 4 Nuut)” is given b 
over time ft, this expression takes the form out) 1S & y 


and hence the maximum value of (Ps/ 


Soh de 


wp (f sinc? x sinc? yd f)* 
f sine? x sinc* ydf a 


= On) | i 
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SUMMARY 


A general expression is obtained for the maximum rate of com- 
munication of information in pulse-code modulation systems using 
any scale of numbers, when the interfering noise has a Gaussian 
amplitude distribution. The particular cases of binary and tertiary 
coding are dealt with in more detail, and a comparison is made between 
the communication rate R and the ideal channel capacity C. It is 
shown that for both systems the communication efficiency R/C reaches 
a maximum of the order of 80% at particular signal/noise ratios. 
It is also shown that in all systems, regardless of the scale of numbers 
used, the communication efficiency tends to 63-6% as the signal/noise 
ratio tends to zero. 


(1) INTRODUCTION 


Any communication system in which a signal is sampled 
regularly and the sample amplitudes are quantized and trans- 
mitted as numbers is a pulse-code modulation (p.c.m.) system. 
The effect of noise is to introduce errors when decoding at the 
receiver, and previous assessments of the performance of p.c.m. 
systems! have been based on the signal/noise ratio required to 
keep errors down to a specified minimum. Whilst this is a 
good criterion to use with the simple practical systems at present 
in use, it is not a true measure of the potential capacity of systems 
for transmitting information if an error-correcting code is used. 
Without in any way suggesting how this error correction is to 
be achieved, the paper is concerned with the calculation of the 
true maximum rate of communication of information in p.c.m. 
systems, from a knowledge of the a priori and a posteriori 
uncergainties of the transmitted symbols or numbers. 


(2) CODING AND DECODING 


To simplify the exposition, all voltages in p.c.m. systems, 
both transmitted and received, will be referred to unit resistance. 
Also, the interfering noise will be assumed to have a Gaussian 
amplitude distribution, in which case, when the noise has power 
o”, the amplitude distribution is 


1 — yy 
BO) ov/ (277) oe ( 202 ) 


In the general p.c.m. system a continuous message of band- 
width Afis sampled at a frequency 2Afand the amplitude of the 
samples is quantized into Q levels. If coding is to be in scale 
of n numbers, then Q is chosen such that 


SY) (2) 


where X is an integer. It is then possible to specify the message 
by a succession of scale-of-n numbers transmitted at a frequency 
2XAf. In all that follows a simple p.c.m. system in which 
X = 1 will be assumed. This restriction is imposed solely to 
provide a simplicity of description, and the arguments can very 


(1) 


Correspondence on Monographs is invited for consideration with a view to 
publication. 


Dr. Billings is at the University of Bristol. 


easily be extended to systems where X¥ ~ 1. Thus the following 
arguments are only strictly true for what are termed one-to-one 
coding systems but can easily be modified for other systems. 
These simple p.c.m. systems operate as follows. A con- 
tinuous message of bandwidth Af is sampled at a frequency 
2Af and quantized into n levels, and pulses of short duration 
with respect to 1/Af are transmitted to identify the quantized 
level. The coding system is such that when the rth level in the 
series 0, 1,2...r...(n — 1) is selected a pulse of amplitude 
(2r —n+1)K is transmitted, where K is a constant. It is 
convenient to introduce another parameter k which relates K 
to the interfering noise. If o is the r.m.s. noise voltage, k is 
defined as K/o. Thus the transmitted signal consists of narrow 
pulses spaced 1/2Af seconds apart, having amplitudes selected 
from a group of amplitudes spaced 2ko apart and ranging from 
(n — 1)ko to (1 —n)ko. The pulses thus produced are passed 
through the transmission path which, for the present, will be 


assumed to have a rectangular pass characteristic with cut-off . 


frequency Af. The output from this transmission system no 
longer consists of pulses, but is a smooth time function with 
random noise superimposed on it. This signal + noise is 


sampled at the receiver at times corresponding to the transmitted - 
pulses, and an assessment is made of the probable sequence of ~ 
transmitted numbers, using the decoding method described below. ~ 


Again, in the interests of simplicity, the overall transmission 
system, including amplifiers, will be specified as such that in the 
absence of noise the amplitudes of transmitted and received 
samples shall be equal. 

The decoding technique is the simplest possible. The ampli- 
tudes of the received samples, including noise, are compared 
with the quantizing levels at the transmitter, and any particular 
received sample is decoded as specifying the number associated 
with the quantizing level to which its amplitude is nearest. 
Owing to the presence of noise, decoding errors will occur and 
there will always be some residual uncertainty about what was 
transmitted. It is this residual uncertainty which must be 
evaluated. 


eS eee. eee eee eel 


Having decoded the received samples, a quantized version of © 
the original continuous message can be regenerated, either by 


passing the pulses through a perfect low-pass filter of bandwidth 


Af or by using a step detector and appropriate equalizer. In — 


this context a step detector is defined as a device which, when 


supplied with a sequence of pulses of amplitudes 4,, A), A3, etc.,.%, 


will produce an output of amplitude A, between the arrivals of 
the first and second pulses and an output of amplitude Ay 
between the arrivals of the second and third pulses, and so on. 


(3) CHANNEL CAPACITY 
Shannon? has derived an expression for the capacity of a 


channel for transmitting information in the presence of Gaussian — 


noise. If the noise power is P,,, the mean signal power is P, and 
the transmission bandwidth is Af, the channel capacity is 


C= Aflogs (ee yee ee (3) 
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In any practical system subjected to restrictions on mean power, 
' bandwidth and noise, the rate of transmission of information, R, 
is always less than C, the rate at which information is transmitted 
by an ideal system subjected to the same restrictions. A con- 
venient measure of the performance of a practical system is 
_ given by the ratio R/C, which is here termed the communication 
| efficiency. In computing C it is necessary to calculate P,, which 
will involve the parameters of the practical system. This does 
not imply that C is dependent upon the coding system but rather 
that P, is so dependent. For the simple system described above 
the mean signal power is easily calculated in terms of received 
noise power o%, for if all the m numbers have equal probabilities 
of selection 


n—1 
P, = (ifn) & k202(2r — n + 1)? (4) 
r=0 
ph ee 2 
whence P,[P, = et (5) 


and the channel capacity of the ideal system subjected to the 
same restrictions is 


C= Af flog, [3 + (n® — 1)k?] — log, 3} (6) 


(4) RATE OF TRANSMISSION OF INFORMATION 


If x;, X2..., etc., are the transmitted numbers and y,, y2..., 
etc., are the decoded versions of these numbers, it is possible to 
specify the uncertainty of the transmitted sequence as A(x), 
that of the received sequence as H(y), and the uncertainty of 
the sequence x11, X2/2..., etc., as H(x, y). It can be shown? 4 
that the rate of transmission of information is 


R= H(x) + HO) (7) 


and therefore the problem is simply that of determining H (x), 
H(y) and H (x, y), the coding and decoding techniques and the 
properties of the interfering noise being known. 


— H(x, y) bits/sec . 


Go = - 8a a0 (4) foe [tee (5) ]+[ 3 +ert (4) Jive 
en fe[2s]-ac5] m (o 55] D) 
wT oes - on™) eee eee eG) 


m 


n—1 
+ > {! 


(4.1) The Uncertainty H(x) 


It is well known? that the uncertainty of numbers selected 
independently at a frequency 2Af from n possible numbers is 


n—1 
H(x) = — 2Af ae p(i) log, p(i) bits/sec (8) 


where p(i) is the probability of occurrence of symbol i. Further, 
*f, as assumed here, all m symbols have equal probabilities of 


(9) 


selection ; 
H (x) = 2A flog, (n) bits/sec . 


Thus the problem of calculating H(x) is simple. 
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(4.2) The Uncertainty H(x, y) 


To determine H(x, y) the decoding technique must be known 
in order to calculate the probabilities p(i, /) for i and j between 0 


-and (x — 1), where p(i, j) is the probability that i is transmitted 


and that j is received. The probabilities can be stated under 
four different conditions: 


G) Wheni =j #0 or (n — 1). 


LP ed 1 ko —y2 
pif) = =r exp (Sa jae (10) 
ck 
not (55) (11) 
Gi) When i =j =0 or (n — 1). 
sale 1 (ee) —y2 
Pi, j) = saan oP ($3 ae . (12) 
1 k 
= a + erf (a) | (13) 
(ili) When i = j + mandj #0 or (vn — 1). 
; (|2m| +1)ko 
ae —v 
Pi, j) = ea) exp (57 aw (14) 
(|2m| —1)ke 
wl (|2m| + Ik (|2m| — 1k 
= at ef] 2. | ert | V2 (15) 
(iv) When i =j + m, andj =0 or (n — 1). 
ae a 1 —y 
Pi, j) = eal exp (Sa )a (16) 
(|2m|—Iko 
peg (2m — 1)k 
Sates | me 
n—1,n—1 
Now H (x, y) = 2. PI pi, j) logy p GS) (18) 
i=0, j= 
From which 


aE: aL! 1+ ent(F5) | 


(4.3) The Uncertainty H(y) 


To calculate H(y) the probabilities p’'0)...p’U)... 
p(n — 1) must be computed, where p’(j) is the probability 
that at any instant the received waveform lies within the limits 
(27 -—n-+1+1ko. These probabilities can be obtained in 


two sets, with the knowledge that p’(j) = p(n — 1 — j). 
First, when j = 0 
1 mete ik eae 
——————— S— \Vol 20 
Pd) nov/ (27) x | Be ( Te) 4 2) 
Qr—1)ke 
sh 1“=! (2r — 1k 
eee 21 
i nea | v2 | ca 
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Secondly, whenO0 <j< 
ko (2r+1)ko 


1 aN "yr > (55)a 
(Vac\y SO es See ex Se Nity 28 exp (=~ ns 
Pi) nov/ (27) *P ( 207 rex 20? 
—ko (Qr—ko 
(2r+1)ko 


n—j—1 —4y2 
SS exp (S33) @2) 
r=j+1 20° 
(2r—ko 
From the above, when x is even 
es i (2r — zat 
Hy) = — aaj {1-2 See ort | UE 
(2r 


<i (2 Bee 9) 


ial (2m + 1)k (Qn — 2m —1)k 
+ 5 fer | sa + erf | pa 


: 2n—2 fom 
x oem | a | te ve ae) 


where M is the next integer below (n — D2. 
When n is odd an additional term arises and 


Hy) = #1) — er ert (75 1) toes | ot (75) | 


The expressions for H(y) and H(x, y) are rather complicated 
but simplify considerably for low values of n. The solutions for 
n = 2 and n = 3 will now be considered as special cases. 


(24) 


(5) BINARY P.C.M. 


With binary p.c.m. the quantizing levels are +ko and —ko. 
The probabilities p(i), p(i, /) and p’(/) are easily calculated, thus 


p0)=p) =0-5 
p'(0)=p'l) =0°5 


pO, 0) = pi, 1) = ‘3 L erf (4)| 


oe pO, 1) = p(l, 0) = Ae oes (4)| 
Re ant + erf ee! logs E pet (5) | 
E erf Gai log, E a (|! 


and it is already known, from eqn. (6), that 
C = Af log, (1 + k?) 


Calculations of the communication efficiency R/C have been 
It will 


made for several values of & and are plotted in Fig. 1. 
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Fig. 1.—Variation of communication efficiency with size of 
quantizing steps. 


be seen that the efficiency is a maximum for a signal/noise power 
ratio of about 2 and the value of this maximum is about 76%. 


(6) TERTIARY P.C.M. 


For tertiary p.c.m. the probabilities p(i), p’(j) and p(i, j) are 
still fairly easy to calculate. We have 


pO) = p(l) = p22) = 4 
and probabilities p(i, /) and p’(j) are given in Tables 1 and 2. 
Table 1 
PROBABILITIES p(i, /) 


Table 2 
PROBABILITIES p’(/) 


i 0 


a5 al 3k 
Ph) 3 gt (= 


whence 


ne Sab se (| logy E + erf (5)|4 E — erf (5) log, E —erf ey + erf (5) log, 2 erf (4) | 
& lex (=3) —erf (4) log, E ( ss) erf ( = a [ —erf 6) log, [ —erf (33 | 


= at (S5) toes | 3 art (34) |—3 ic —Sert (35) ]toso| 


(26) 


Pac)) 


whilst C= Afilog, (1 + ) (27) 
Again R/C has been plotted as a function of k, and it will 
be seen that the peak efficiency occurs when the signal/noise 
power ratio is about 0-5 and that the maximum efficiency is 
about 82%. 


(7) GENERAL P.C.M. SYSTEM AT LOW SIGNAL/NOISE RATIOS 


Although the expression for R becomes complicated and 
clumsy when zn is greater than two, it does simplify if k is made 
very small, i.e. if the signal/noise ratio is low. The approxima- 
tions which are useful in these conditions are that when k is 
small 


k 

erf V2 ~ ak (28) 

2 
where et = f= (29) 

qT 

and that when z is small 
2 

log, da +2) ~z— ae (30) 


z 
Inserting these approximations into the expression for R gives 


2Af 


~ nilog,2 


‘It can be demonstrated that the terms in «k are zero; thus when 
ke is very small, ignoring third-order powers of ak and above, 


QA fork? 1 nn —2)% "=2(2r — 1)? 
n log, 2 E 2 a x 2 2) 
DG \ HP 
Eee (2 l)a*k (33) 
6 log, 2 
ine aol J ke? 
From eqn. (6), C= Aflog, E + Veo 
ywhich, when k is very small, becomes 
gun 2A fk jn — ) 
log, 2 6 
R 
. 1 Se) = 
‘whence : im ( =) a 
= ta ‘ (34) 
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Thus the communication efficiency in all p.c.m. systems of 
the type described above is 63-6% at low signal/noise ratios. 


(8) EXTENSION TO SITUATION WHERE THE TRANSMISSION 
PATH DOES NOT HAVE A PERFECT LOW-PASS 
CHARACTERISTIC 

If the transmission system has a pass characteristic which is 
not rectangular, then at the time corresponding to a particular 
received sample there will be a residual error due to the after 

effects of previous transmitted pulses. In order to evaluate H(y) 

and H(x, y) in these circumstances, the distribution of the residual 

error must be calculated for each of the received levels, and must 

be taken into account when calculating the probabilities p’(/) 

and p(i,j). If the impulse response of the transmission system 

is known, such a computation is straightforward, but very 
tedious. 


(9) CONCLUSIONS 


It can be seen that over a wide range of signal/noise ratios 
the communication efficiency of p.c.m. systems can be in the 
range 70-80°%, and that even at very low signal/noise ratios 
the efficiency is greater than 60%. In existing p.c.m. systems, 
without error correction, the signal/noise ratios that are used 
are such that the communication efficiency is less than 33%. 
This suggests the need for the investigation of error-correcting 


D 


we 2 — 
R= {a ths ae)( uk at a + nfl —(n — 20k] Ec Dai ee oF ie | 


5 (31) 


codes which will permit the higher efficiencies quoted above to 
be achieved. 
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THE USE OF EQUIVALENT SECONDARY SOURCES IN THE THEORY OF Croll 
WAVE PROPAGATION OVER AN INHOMOGENEOUS EARTH 


By Z. GODZINSKI. 


(The paper was first received 20th February, and in revised form 3rd Pte 1957. It was published as an INSTITUTION MONOGRAPH 
in April, 1958. 


SUMMARY 


The paper deals with the propagation of a vertically polarized 
ground-wave over an inhomogeneous spherical earth. On the basis 
of approximate boundary conditions and a specially chosen auxiliary 
function in Green’s theorem, the integral equation for the attenuation 
function is derived. A general method of solving this integral equation 
by numerical techniques for paths composed of a number of homo- 
geneous sections is discussed. It is proved that the result is in agree- 
ment with the reciprocity theorem. 

The actual field is shown to be the superposition of the fields 
generated by the primary source and by secondary sources distributed 
along the path. It is demonstrated that the distributed secondary 
sources may be approximately replaced by a number of suitably 
chosen equivalent secondary sources. An approximate method of 
calculation based on this property is introduced. The method is 
discussed in some detail in two cases: when the paths may be regarded 
plane and when they extend into the diffraction zone. The proposed 
method makes possible a simple and rapid calculation of field strength 
in many practical cases. It may be used for any phase characteristic 
of the complex permittivity of the soil. 

The paper confirms the important conclusion that placing the 
transmitting or receiving antenna over a well-conducting soil may 
considerably improve the reception. It is shown further that the 
electrical properties of the earth in the neighbourhood of the trans- 
mitter also have an influence on the phase of the field. 


LIST OF PRINCIPAL SYMBOLS 


E = Electric force, volts/m. 
E = Effective value of normal component of 
electric field intensity, mV/m. 
P’ = Power radiated by a given dipole over a 
perfectly conducting plane; kW. 
Po = 300\/P’ = Primary source. 
D = Length of the path, m or km. 
x = Distance from the transmitter to the con- 
sidered point of the path, m or km. 
Xj~1, X; = Distances from the transmitter to the ends of 
jth section (x9 = 0), m or km. 
a = Earth’s radius, m. 
A = Wavelength, m. 
k = 2n/A = Propagation coefficient in free space, m~!. 
o = Conductivity of the ground, mhos/m. 
<« = Relative permittivity of the ground. 
«’ = Complex relative permittivity of the ground. 
p = Sommerfeld’s numerical distance. 
y(p) = Sommerfeld’s attenuation function for plane 
earth. 
s = Factor transforming geometrical distance into 
numerical distance. 
So = Parameter s for auxiliary homogeneous earth. 
w = Attenuation function. 
Subscripts of w denote the method of 
calculation: 
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t = Theoretical (correct) value. 
ss = Value computed by means of secondary’ 
sources. 
Ty = % — iBo,5 = Parameters in residue series [see eqns. (78), 
(79) and (83)]. 


if 


(1) INTRODUCTION 


The growing importance of modern means of telecommuni- 
cation and radio-navigation has increased the interest of 
theoreticians as well as practical engineers in the problem of 
radio-wave propagation over an inhomogeneous earth. There 
exist at present a considerable number of theoretical investi- 
gations of this problem.'!-2° In some of them comparatively 
simple formulae are presented, but these are based on con- 
siderable simplifications or are valid only in certain special cases. 
No final formulae are available for the general case and the 
computations become very, and sometimes prohibitively, 
laborious. Furutsu!® 11; 13,14 and Wait!® have performed such 
calculations for certain simple mixed-path problems and have 
shown the results in the form of plots of attenuation function. | 
This procedure seems impossible in the general case, however, 
because of the large number of independent parameters: the’ 
lengths of the sections, their electrical characteristics and the 
frequency. There is therefore a need for practical methods of 
computation, sufficiently accurate and general, and at the same 
time rapid in application; the purpose of the paper is to meet this 
need. 

An approximate method of calculating the propagation over 
mixed paths is discussed, according to which the particular parts 
of the path are replaced by certain equivalent secondary sources. 
The physical content of the method is the process of the multiple 
scattering of waves, except that the distributed secondary sources. 
are replaced by appropriate discrete ones. In simpler cases the 
method makes possible a rapid calculation of field strength or 
attenuation function. If complicated paths are considered, 
however, the interaction between all parts of the path obviously. 
increases the laboriousness of the method. For this reason, in! 
another paper the two most important semi-empirical methods, 
namely Millington’s method and the equivalent numerical- 
distance method, will be compared with the theory and dis- 
cussed as to their practical applicability. 


(2) INTEGRAL EQUATION FOR THE ATTENUATION 
FUNCTION 


(2.1) Formulation of the Problem 


The analysis in the present paper is carried out for the case 
in which short vertical transmitting and receiving antennas 
(dipoles) are situated at the points A (transmitter) and B (receiver) 
close to the surface of a horizontally inhomogeneous spherical 
earth of radius a. In the vertical direction the ground is assumed 
homogeneous; the atmosphere is assumed homogeneous, of 
relative permittivity 1. The complex relative permittivity of the 
ground is denoted by e’, and the permittivity and propagation 
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vefficient of free space are respectively denoted by e€, and k. 
The time factor is taken to be exp (iwf). 


2.2) Derivation of the Integral Equation for the Attenuation 
Function 


As for a homogeneous spherical earth, we introduce the 
adial Hertzian vector, rll, and determine the electric and 
aagnetic field intensities in the atmosphere outside the sources 
f the field from the relations 


E=curlcurl (ID; H=iwegcurl(rTD) . . . Wd 


“he vectors E and H satisfy Maxwell’s equations if the scalar 
unction II satisfies the wave equation 


V2 lesa SON hucn (2) 


aqns. (1) may be used to determine the components of E and H 
n spherical co-ordinates; because of the lack of complete axial 
ymmetry the expressions are now not as simple as for the 
,somogeneous earth. This fact is important as regards the con- 
ection between the radial component of electric field intensity 
nd the function II. It is easy to show that the differences 
etween the homogeneous-earth expressions and the present 
mes are generally negligible; in fact, they are connected with 
e change in direction of the wavefront caused by the inhomo- 
reneity of the ground. The most pronounced perturbations 
scur on passing a land-sea boundary, giving rise to the ‘coastal 
iefraction’ phenomenon. As the coastal refraction angles 
sually amount only to a few degrees,?: >» 1% !4 it may be shown 
at the homogeneous-earth expressions for the components of 
© and A constitute generally a very good approximation in the 
ase of the inhomogeneous earth as well. 
‘Of fundamental importance for the proposed theory is the 
troduction of the so-called approximate boundary conditions. 
These are based on the assumption that the wave propagating 
ide the earth has approximately the character of a plane- 
wave. The fact that the electrical properties of the soil are not 
constant in the horizontal direction may change the character 
bf the wave propagating inside the earth and thus change also 
the boundary conditions; this effect will depend on the rate of 
wariation of the earth constants. Consequently the approximate 
ooundary conditions for a homogeneous spherical earth,*!> 2.3.5 
ee 
or Ve 4+ 1) 

Are approximately valid also for the inhomogeneous spherical 
earth, except at places with very rapid changes of ¢’.2> This 
circumstance, important for a spherical as well as for a flat 
earth, has been generally disregarded in existing researches. 
Some investigations of Feinberg? ?»> have shown that in most 
practical cases such simplification is admissible. In the imme- 
diate neighbourhood of a boundary between different sections 
of a path the deviation from eqn. (3) may, however, be con- 
siderable, but the influence they can exert on the field charac- 
teristics still awaits clarification. In the present analysis we 
avill assume boundary conditions as given by eqn. (3). 

Further analysis is carried out on the basis of Green’s theorem, 
ieee 


(3) 


[vet — TV72,)dV = fee. 8) as C)) 


The surface S bounding the volume V is shown in Fig. 1. It is 
formed by the infinitely distant surface S.,, two very small hemi- 
spheres containing the sources of the field ($4) and the point 
! reception (S,), and the surface of the earth except in the 
immediate vicinity of points A and B (S,). For the function v, 
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Fig. 1.—Volume and surfaces of integration in eqn. (4). 


we assume, like II, the scalar Hertzian function for a spherical 
earth, and only for the case when a vertical dipole is situated at 
the point B and the earth is homogeneous with arbitrary electrical 
parameters. Taking into account the singularities of the 
functions I] and v, near the points A and B, applying approxi- 
mate boundary conditions, and using the principle of stationary 
phase, eqn. (4) may be reduced!> to a one-dimensional integral 
equation for the attenuation function: 


 W(x)w,,(D — x) 
| VED aa” 


The path of integration in eqn. (5) is that along the great circle 
joining the points A and B, x and D being the distances measured 
from the point A along this great circle to the point of inte- 
gration and to the point B respectively. The parameter s is the 
factor transforming the geometrical distance, d, into Sommer- 
feld’s numerical distance p, i.e. 


w(D) = w,,(D) — i a 2 a/so)dx . (5) 


k 


Ic Oe +1) (6) 


p=sd;s= 
This parameter changes along the path, i.e. it is a function of x. 
The parameter sq is the one for our auxiliary homogeneous earth; 
it is constant and quite arbitrary. The attenuation functions w 
are taken with respect to twice the field in free space; accordingly, 
at very short distances they approach unity. The arguments in 
parenthesis are the distances, and the subscript sy denotes the 
attenuation function over a homogeneous earth of parameter so. 
Similarly w(x) and w(D) denote those over the inhomogeneous 
earth considered at distances x and D from the transmitter; 
ws,(D) is the attenuation function over the homogeneous earth 
So at a distance D; and w,,(D — x) is that over the homogeneous 
earth sp at a distance D — x, i.e. either when the transmitter is 
at the point B and the receiver at the point x, or when the 
transmitter is at the point x and the receiver at the point B. 
For the path composed of several (n) homogeneous sections, 
eqn. (5) becomes 


“w(e) W59(D — x) 
V/[x(D — x)] 


Xj=1 


WD) = Wa(D) — inf > BY o/s — V'50) 


dx 


(7) 


where x;_1, x; denote the ends of the jth section with the para- 
meter s = Sj. 

Eqn. (7) is the fundamental integral equation of the proposed 
theory. In the plane-earth approximation it gives the integ- 
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ral equation of Feinberg’s theory?»?»> and is equivalent to 
the 2- and 3-section plane-earth formulae of Clemmow,® 7 
Bremmer? and Wait.!®© For the spherical earth also there is a 
good agreement with other theories. It will be shown later that 
for a path composed of very long sections eqn. (7) gives the same 
result as Furutsu obtained previously by a different method.!?-'4 
Wait, in his investigations,!? obtained for 2-section paths thesame 
formula as eqn. (7), and Bremmer very recently”? derived an 
expression essentially equivalent to it. 

The approximations used in the derivation of eqn. (7) are 
twofold. First, there is the adoption of the stationary-phase 
principle, which means inter alia that no great changes of 
electrical earth parameters occur in the direction perpendicular 
to the path within the first few Fresnel zones. In conditions 
where considerable lateral inhomogeneities may be found, devia- 
tions from eqn. (7) must be expected. Secondly, there is the 
approximation which gives, as Fresnel zones, ellipses with a 
major axis equal to D instead of a somewhat greater value. 
This approximation is also to be found in other theoretical 
investigations and may be of some influence for the points of 
the path lying in the immediate vicinity of the boundary between 
two different sections. This is another justification for a more 
thorough investigation of the transition regions. 


(2.3) Evaluation of the Attenuation Function 


The great advantage of the integral equation (7) is the ease 
with which it makes possible the calculation of propagation 
characteristics for every multi-section path. To this end we 
make use of the arbitrariness of sg, which is a direct consequence 
of the auxiliary homogeneous earth previously introduced of 
arbitrary parameter sg; this ingenious method of attack is due 
to Feinberg.” >> 

The attenuation function over the first section is the same as 
over a homogeneous ground with the parameter s=s,. In 
order to compute the attenuation function over the second 
section, 59 = S is assumed. The integral in eqn. (7) then 
vanishes along the second section and the remaining integral can 
be calculated. Having computed in this way the course of the 
attenuation function along the second section, we assume 59 = 533 
thus the integral along the third section vanishes and the 
remaining integrals along the first and second sections can be 
computed, etc. 

In practical calculations a convenient unit of length is 1 km; 
eqn. (6) for s then becomes 


a x 103 
A[600A + ie +1] > © 7 * (8) 


Ss 


where the wavelength A is given in metres and the conductivity 
of the ground o in M.K.S. units; s is then given in (kilometres) ~!. 
Curves of |s| as a function of A for typical soils are plotted in 
Fig. 2. 

The calculation of the attenuation function according to 
eqn. (7) is in principle very simple; in practice, however, the 
computations are generally very laborious. In subsequent 
Sections an approximate method of calculation will be presented 
based on the concept of equivalent secondary sources. Some 
formulae will also be given which have been derived in simpler 
cases directly from egn. (7). 


(3) EQUIVALENT SECONDARY SOURCES 


(3.1) Introduction of Secondary Sources 


The electric field intensity depends on the electric moment of 
the radiating dipole. If, instead, we characterize the source by 
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2.—Variation of |s| with wavelength for typical soils. 
(a) Sea (€ = 80; o = 4-5mhos/m) 


(b) Fresh water (« = 80; o = 2 x 10~3mho/m). 
(c) Wet soil (€ = 20; co = 2 x 10—2mho/m). 

(d) Average soil (€ = 10; o = 2 X 10~3mho/m). 
(e) Dry soil (€ = 2; 6 = 2 x 10—4mho/m), 


Fig. 


the radiated power, we obtain for the normal component of ) 
electric field intensity the well-known formula 


E(D) = 300 VP WD) : (9) © 
In this formula E is the effective value of the field strength in | 
millivolts per metre, D is the distance in kilometres, and P’ is 7 
the power, in kilowatts, which the dipole of a given electric | 
moment would radiate over a perfectly conducting plane. In. i 
eqn. (9) the rapidly varying factor exp (— ikD) has been omitted } 
as it has no influence on the subsequent discussion. To simplify / 


the notation we will call 300\/P’ the primary source po, i.e. 
Po: 300\/ Pi, ee 


(10) § 

Eqn. (9) then becomes E(D) = *°w(D) a1 | 

Multiplying both sides of eqn. (7) by po/D, we obtain 
i n 

E(D) = E(D) — 7, BW — V0) 


x4 


| Ve « = 
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(12) | 


In this formula E,,(D) denotes the field which would be produced — 
by the given sources at a distance D over a homogeneous earth | 
of s = 59, and E(x) denotes the field existing at the point x | 
over the inhomogeneous earth under consideration. Eqn. (12) _ 
may be interpreted as follows: | 

For the calculation of the electric field over an inhomogeneous 
earth we may formally assume the earth to be homogeneous » 
with a certain (arbitrary) parameter s = so. The electric field 
E(D) is then the superposition of the field E,,(D) produced by 
the primary source pp and the fields produced by the secondary 
sources dp distributed along the path given by | 


ap sav s—+/ EW, ice a *) dx “ae 


(13)8) 


Eqn. (12) thus becomes . 
Wa(D = x) (14) 


D 
E(D) = E,(D) + || dp 


The intensity of the secondary sources is not constant but 


epends on the distance D by the factor [x(D — x)/D]!/2. It 
ay be shown!» that this factor is related to the width of the 
area within the few first Fresnel zones, which, according to 


Feinberg,” 3 may be regarded as the effective path of the radio 
nwaves. 


(3.2) Agreement with the Reciprocity Theorem 


One general problem of great importance must be mentioned 
at this point; it is the question whether eqn. (7) satisfies the 
reciprocity theorem. When discussing this problem we may 
(formally assume the earth to be homogeneous of some para- 
eter s = So and calculate the field as the superposition of the 
elds of the primary source and the secondary sources distributed 
along the path. The secondary sources are, however, propor- 
itional to the field intensity at the point considered, and this field 
in turn is the result of the superposition of the fields of the 
primary source and all secondary sources preceding this point. 
‘Consequently, the field at the point of reception represents the 
superposition of the fields, being the result of the interaction of 
all elements of the path in all combinations of successive ele- 
ments; the transmitting antenna is regarded here as the first 
element. Considering one of such combinations of successive 
elements, it is easy to prove!> that the partial field thus produced 
‘satisfies the reciprocity theorem. Consequently, the total field, 
bsing the sum of such partial ones, will also satisfy the 
‘reciprocity theorem. 

The agreement with the reciprocity theorem is of great 
importance, not only because it ensures that in the course of the 
janalysis no inadmissible simplifications have been made, but 
jaiso since it makes it possible during the calculation to place 
jthe primary source at this end of the path, which is more con- 
venient as regards the errors or the laboriousness of computations. 

. The above discussion is important also in another respect, i.e. it 
has shown that mixed-path propagation is in fact a process of 
multiple scattering. 


(3.3) Introduction of Equivalent Secondary Sources 
As mentioned before, secondary sources dp depend on distance; 
lit proves convenient to introduce secondary sources dp.., cal- 
culated for D = 00 and therefore independent of D: 


i 


tds = — -(y/s o/s JEN dss. dp = de 0) 


V 
Eqn. (14) then becomes 
p D—x\ w,,(D — x) f 
EXD) = E; 4D) + oe = (16) 
(D) = E,,(D) J doa ( >) 


Let us consider the partial field Ej,g, produced by secondary 
sources placed over the section d; < x < d); it is given by 


(17) 


do 
DE NYA 3) 
Ena, ~ | don/( D D—x — 


‘For simplicity we put 
/ DN WiD as) 
VV ( D D—x 


‘Expanding F(x) in a Taylor series about a certain point x = G 
vwe obtain 


F(x) 


d2 dz 
Esa = F(O)| don + FO) Kc — b)dpe 


dz 
+4F"O | @— dro +... - (18) 
aq 
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For not too large distances (x — &) the third term in egn. (18) 
is very small. If we demand that the second term be equal to 
zero, we obtain 


(19) 


Because of the complex character of dp,,, eqn. (19) in general 
cannot be satisfied; consequently we modify it and demand 
that* 


dz 
| lsaeel 
f= err Be (20) 
| ee | 
dy 
Denoting 
d> 
2 D 2 & 
Pdidoy = ie ADo; Paya Sane (21) 
1 
we obtain finally 
W,(D'— &) 1B Ws,(D — &) 
Eid: = Pads py —— = Padtdoey v( D 5) D a é (22) 


The result just obtained may also be formulated in terms of the 
attenuation function. From eqns. (22) and (11) we may regard 
the attenuation function w(D) as the superposition of the contri- 
butions from the primary source and the secondary sources. 
The contribution from the part of the path considered is then 


D Pad D 
vate = Ende = Ee |(5? wD - 8 23) 

Eqn. (23) states that the secondary sources distributed over 
the section of the path d, < x < d, may be replaced by a single 
equivalent secondary source pgjq2 placed at their ‘centre of 
gravity’ at the point x = €. The site of this equivalent secondary 
source does not depend on the distance D; its magnitude does 
depend, however, on distance in the same manner as do the 
elementary secondary sources dp. 

An important property of the equivalent secondary sources is 
the ease with which they can be calculated for any value of 5p. 
In fact, if, for a certain sy = so, they are equal to p’, then accord- 
ing to eqns. (15) and (21) their value p” for another sg = sq’ is 


given by ‘ 
pe _ v/s = V0 5) : 
V's — v/'So 


The introduction of equivalent secondary sources is important 
not only because it leads to an approximate method of com- 
putation but also because it helps to provide a clearer picture of 
the phenomena occurring. We will illustrate this by one general 
conclusion regarding the errors of computation. In some cases 
the primary source and the secondary sources may cancel one 
another almost completely. In such cases the calculations—by 
all methods—must be performed especially carefully if excessive 
errors are to be avoided. 


(24) 


(4) INFLUENCE ON THE FIELD OF A SMALL 
INHOMOGENEITY OF THE PATH 


The investigation of the change of the field due to an inhomo- 
geneity on an otherwise homogeneous path can give a valuable 
insight into the mechanism of the influence of particular parts 


* The consideration of the way in which the contributions from the elements of 
the path are added together to form the field Ea,q) has led to the conclusion that it 
would be more suitable to accept eqn. (20) than to assume for simply the modulus of 
the right-hand side of eqn. (19). 
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of the path on the field. The analysis can be made very simple 
under the assumption that the inhomogeneity is constituted by a 
very short section of length dx, placed at the point x and 
characterized by the parameter s’ different from s for the rest 
of the path. The ‘primary’ field which would exist at the point 
of reception over the homogeneous path s would be equal to 
Ear = Bw, CD) (25) 
In order to compute the field in the presence of the inhomo- 
geneity, we assume sy = s and sum the fields from the primary 
source py and the secondary source dp. The change of the 
field dE with respect to the primary field E,, will be produced 
by the secondary source dp: 


ae w,(D — x) 
—x 


dE=5 


Bele VEO. | helena (D — x)dx . (26) 
Jt D(D =), : “heh 


The assumption of a very small length of the section s’ enables 
us to take for the field E(x) approximately the value which 


would exist at the point x on a homogeneous path s. The 
relative change of the field is thus 

dE i } 4/D w(x)w,(D — x) 

— Ss s)— obe (7 
Be an Ye] | AD) a 


Eqn. (27) shows clearly that the influence of the inhomogeneity 
depends strongly on its location over the path. It is greatest 
for small distances from the transmitter or receiver, and least for 
locations in the middle of the path. 

As an example we will consider eqn. (27) for not too great 
distances, i.e. we will use the plane-earth approximation; 
eqn. (27) then becomes 


dE _Vs—vs' Ae »(Z0)>] (1 5 | 
fe /s von | 2(1 oe 2) 
ed SaaS 
See seid) 


where p = sD. The function 7 has been computed for a few 
values of p; the corresponding curves are shown in Fig. 3. For 


d| > 


(28) 


x/D 


Fig. 3.—Curves of the function |7| from eqn. (28). 


(a) |p] = 1. () |e| = (c) |e| = 100. 
v= 0°; = 1909: _ t aie 
v = arc tan 600K" 
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small numerical distances the influence of the inhomogeneity is, 
also very small, since the attenuation of the wave is then insigni- 
ficant. As p increases, this influence increases at first and then 
begins to diminish rapidly. 

The results given here are valid in principle only for infinitely 
small inhomogeneities. They present, however, a possibility of 
ascertaining in practical cases whether an exact calculation of | 
the influence of a given inhomogeneity is indispensable, or if it | 
is sufficient to estimate it, or, finally, if it may even be completely 
disregarded. These results are also of great importance as 
regards the understanding of the phenomena occurring. 


(5) CALCULATION OF SHORT MIXED PATHS IN THE 
PLANE-EARTH APPROXIMATION BY MEANS OF 
EQUIVALENT SECONDARY SOURCES 


(5.1) General Formulae 
Assuming a plane earth, eqn. (7) takes the form 


eCOvitolD — x)] 
vie =o) 


w(D) = y(spD) — ine 23) (V/s; — V’S0) 


(29) | 


where y is Sommerfeld’s attenuation function for a plane earth, } 
and sD, so(D — x) are numerical distances [see eqn. (6)]. 

We will consider first the equivalent secondary sources over | 
the first section of the path for the part lying between the points |) 
d, and dy (where d; <x <d,). According to eqns. (11), (15) | 
and (21), 


A da 7 
Pasty = — GalVs—Vsdeo] “de. . GOI 


Denoting the numerical distance by p and using the relation 5 
i| C1 me ‘ Pin a 
4 rep" dp = —i(/7) — > YOU ee . BI 
we introduce a new function v(p), where 


Pap A — 


(py) = = as a 


Eqn. (30) then takes the en 


Padre = po Ysa — v(sd;)] 


Polar diagrams of the function v(p) for five phase angles of the | 
parameter s are shown in Fig. 4; they have been computed by | 
means of Karpov’s tables.?3 The asymptotic formulae for v(p) | 
are: 


For small p (|p| < 1), 


oy? 4 
v(p) 7 Nee p+ nae os (34) ; 
For large p (|p| > 10), 
| 
eran G9} 


In order to compute the position of the centre of gravityé of | 
the secondary sources, we use eqn. (20), which gives 


fi | pdv(p)| -fi |pde(p)| 


fuel [~(p2) — ea 


where p; = sd, and pz = sd. 


(36) | 
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1 
Fig. 4.—Polar diagrams of function v(p) for the parameter w = arc tan = : 
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Fig. 5—| |pdv(p)| = (|p|) for the parameter % = arc tan 60GA" 
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The integrals occurring in eqn. (36) have been computed for 
three phase angles of the parameter s; corresponding curves are 
shown in Fig. 5. For very large and very small numerical 
distances eqn. (36) can be evaluated analytically, giving 


For very large p, Ex V/add) (37) 
For very small p, E~ Yd, +V/(d,d) + ad] (38) 
For d, = 0, d, = d, eqn. (38) gives 

Ex ld (39) 


In Fig. 6 are shown curves of €/d for the section at the antenna 
(d; = 0, d,=d). It follows from them that the larger the 
numerical distance p = sd, the smaller the ratio Eld. 


5 10°2 510 2 5102 510 


lp| 
for section at the antenna; 


5 10'2 


5 1072 


Fig. 7.—Relative differences between the arithmetic mean and values 
calculated according to eqns. (37) and (38). 
ee 3, + do) — $Id) + V(did2) + do). is 3(d, + do) — V/(did2) 
d2—a d2— a 


Where the ratio d,/d, is not too large, we may take for é 
instead of eqns. (37) and (38) simply the arithmetical mean, 
€ = 4(d, + d)); the relative deviations from the correct values 
are shown in Fig. 7. 

Where the first section of the path is sea, we may often assume 
it to be perfectly conducting, which gives s=0. By virtue of 
expansion (34), eqn. (33) then takes the form 


a) 

Ridge! ie: PolV (5942) — +/(so4y)] (40) 
For the calculation of € we use in such cases eqns. (38) and (39). 

The equivalent secondary sources over the farther sections of 
the path may be calculated in an essentially similar way as for 
the first section.!> Because of the practical applications we will 
consider only the equivalent secondary sources for the second 
section of the path when it is relatively short. Analysing this 
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problem we make use of the reciprocity theorem and of the 
results just obtained for the first section of the path. After an 
easy but somewhat lengthy calculation it may be shown!> that 
the equivalent secondary sources p2.. for a short second section |} 
are given by 


Pra = Phos + Poin (41) | 
whete: “ip? pov 2 Mos.) —v(5,x)] . . Gq 
1 


aby = 
Be Po 


4/8. — V'So 4/82 — a/'54 ylsi(E> = &)] uls Ch. x] 
4/85 /'S2 v/[sx(Eo — &)] ~ 7 i 
= (43) 


The notations used in eqns. (42) and (43) are partly illustrated i 
by Fig. 8. &, denotes the position of the centre of gravity of } 


@) 


Fig. 8.—Notations used in eqns. (42) and (43). 


secondary sources over the second section of the path. An . 
examination of eqns. (20) and (41) shows that for practical | 
purposes the centre of gravity may be placed in the middle of | 
the section, i.e. ! 


OM Ede ae 85) (44) | 


€5 denotes the position of the centre of gravity of the first section -K 
of a path, which would be created by placing the transmitting | 
antenna at the point €, (see Fig. 8). In other words, £5 is the | 
position of the centre of gravity of the first section, having the z 
length +(x. — x,) and parameter s = sy. . 

In eqn. (43) there has been introduced also a new function, | 
u(p), defined by 


Ee 2 


= PL | 
u(p;) = — we | vp dp = | men + Pr ae sve | - (45) | 


The polar diagrams of the function u(p) for five phase angles of | 
the parameter s are shown in Figs. 9 and 10. The asymptotic 
formulae for u(p) are as follows: 


For small p (|p| < 1), 


Anas i 8 
Sa eS ee ee 52 . 
u(p) pela tate Ja? ae hed (46) 
For large p (|p| > 10), 
i Dy i ; 
u(p) ~ Ate VP oy (47) 


In cases of land-sea paths we may often assume s — 0 for sea 
sections; eqns. (42) and (43) then take the form: 


(a) 5s, = 0 (first section sea, second section land) Si 
a? 
(De ETP OY #3 — VS S\/x2 — x1) (48) 
PP, ee Ae? = V5 ulso(x, — x1)] 


V'S2 V [s2(€. — &5)] 
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e+1 


Fig. 9.—Polar diagrams of function u(p) for smal] numerical distances; y% = arc tan —— 


(b) s, = 0 (first section land, second section sea) 


ma po(/2) [v(six2) — v(s4x,)] 


Oy = 
PS», = 


(50) 


= a4 
. = on eee bing ~e)) . Gn 


(5.2) Two-Section Paths 


The results so far obtained make possible a simple calculation 
of the field in some important mixed-path problems. As a first 
»sxample we will consider a 2-section path with a comparatively 
short first section (Fig. 11). Since, according to the reciprocity 
itheorem, we may interchange transmitter and receiver, the above 
assumption means that we discuss, in fact, the more general 
yoroblem of a 2-section path with one section short as compared 
with the other. We assume so = s, and replace the first section 

y one equivalent secondary source p;, placed at the point &,. 
‘From eqns. (33) and (21), 


= Di 
Dis po 1525.80) es (52) 
[The field strength E(D) at the point B is then 
a y(5,D) y[sxD — E,)I 
1E(D) = Po ee a DRE, 
D 
= Phas jD)) == Rae -ulsrxdq/ (5 ae )y[s@ 7 én} 
(53) 
and the attenuation function for the path is 
— A/S D 
AD) = y(syD) + Vist Vous.) | (Gag) tn@ - O01 
Re ee pes yi (S4) 
i x < D, we may consider &, ~ 0; then 
w(D) © vob) se ai | Lae 555 
V5 


60cA° 


Fig. 10.—Polar diagrams of function u(p) for large numerical distances ; 


e+1 
¢% = arc tan SOaA’ 


Fig. 11.—Two-section path with a comparatively short first section. 


An example is given in Fig. 12, which shows clearly that a 
considerable increase of field strength can be achieved by suitably 
locating the transmitting antenna. 

On the basis of the reciprocity theorem we may also use 
eqn. (55) when computing the field on the second section at 
points near the boundary of the first section. After an obvious 
change of notation, we obtain for the path (from Fig. 13) 


w(D) ~ ,D) + vi Ve 5 | 


52 


(56) 
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Fig. 12.—Increase of the field strength due to the presence of a well- 
conducting section near the transmitting antenna. 
Diagram of the factor [i + Vist — V¥n(5x1)] from eqn. (55). Parameters s; and 
v 


s2 real; s} <s2. Vectors (i), (ii) and (iii) correspond successively to an increasing 
length of a well-conducting section. 


Numiber of secondary sources (on sea) 


w(D)ss/w(D)t 


Gln) 18 


Fig. 14.—Field changes on passing the boundary. 
Vs2—- VS) 


Diagrams of the factor | 1 + v(s28)] from eqn. (56). Parameters s; and 


so real. Vectors (i), (ii) and (iii) correspond successively to an increasing distance 
from the boundary. 


(@) so > 81. (b) s2 < 84. (c) sz = 0. 


3 is the ye from the equivalent secondary source to the transmitter; s7, ss; are the parameters 
si exp ( — ify]. 


s for land and sea sections, respectively; ss = 0; yj is the phase angle of s; [s; 


Some examples of field changes on passing the boundary of the 
first section are shown in Fig. 14. It is interesting to note that 
in the transition to a much better-conducting ground a con- 
siderable increase of field strength can take place; this is a well- 
known ‘recovery effect’. 

So far we have discussed the field intensity; it is interesting, 
however, to consider also the phase of the wave. According to 
eqn. (55) and Fig. 12, the properties of the ground near the 
transmitting antenna will influence not only the amplitude of the 
field but also its phase. This can perhaps give some explanation 
of the recent experiments of Pressey, Ashwell and Fowler on 
radio aids to navigation.?* 25 

For similar reasons the phase of the wave changes after 
transition to a differently conducting ground. Eqn. (56) and 
Fig. 14 show that, on passing to a better-conducting soil, the 
phase velocity of the wave should increase; on passing the 
boundary in the reverse direction the phase velocity should 
diminish. This conclusion is essentially in agreement with the 
results of measurements made by Pressey, Ashwell and 
Fowler.2® 24; 25 

Where the sections of a 2-section path have comparable 
lengths, it may be necessary to use more than one equivalent 
secondary source. How many secondary sources are to be used 
depends on the ratio between the geometrical lengths of the 
sections, on the numerical distances they represent, and finally 
on the degree of inhomogeneity of the path. The first circum- 
stance is obvious. The second arises from the fact that for 
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Table 1 
Ratio w(D),;/w(D), FOR A NUMBER OF LAND-SEA PATHS 


nd of the Jand section, respectively; 


0-16.07 — 45° 
0-145 7 — 47° 
0-125 7 — 49° 


D, ly are the lengths of the whole path a 


k 
K 
i 
1 
i 


G 


] 
1 
1 
\ 


THEORY OF GROUND-WAVE PROPAGATION OVER AN INHOMOGENEOUS EARTH 


great numerical distances the v(p) curve (Fig. 4) shows marked 
curvature and consequently can no longer be approximated bya 
single straight line. The last circumstance applies in connection 
with the intensity of the. secondary sources. According to 
eqn. (13) the secondary sources are proportional to the 
difference (\/s — 4/59). Consequently, where this difference is 
small, the intensity of the secondary sources is also small and 
may be calculated less precisely. However, when this difference 
is large, e.g. for land-sea paths, the influence of the secondary 
sources is great and a more elaborate procedure must be adopted. 
. Because of the great variety of parameters (lengths of the 
sections and magnitudes and phase angles of the parameters S), 
it is not possible to give any general rule as to the choice of the 
number of equivalent secondary sources. In order to obtain 
some indication, several land-sea paths have been considered; 
the results of the calculations are summarized in Table 1. Here 
w(D),, denotes the value of the attenuation function calculated 
by means of the secondary sources, and w(D), the theoretical 
(correct) value computed in the cases considered by numerical 


integration of eqn. (7); this notation will also be used in the 


{ 


] 


subsequent analysis. As mentioned before, the inhomogeneity 
of the ground is most pronounced for land-sea paths, and the 
examples given in Table 1 may therefore be regarded as being 


| iM some sense extreme. In Table 2 are given the results of cal- 


Table 2 


_ RATIO w(D),,/w(D), FOR SOME PATHS OF VARYING INHOMOGENEITY 


wW(D)ss/w(D)e 


Secondary source on section 
S1 52 


0:90 / — 
0:86 / 5° 
0:92 /.9° 
1:01 /6° 


147 —s 
1-36 / — 16° 
me 
1200)/=19° 


0-15 7 —101° 


0-115 / — 152° 
0-091 / — 168° 
0-065 / — 178° 


| 


11, 12, 51, Sz are the lengths and parameters s of the first and 
second sections, respectively; J; = /2; 5, and s2 are real; s4/; 
+ so9ly = 10. When computing w(D),;; one secondary source 
has been used. 


culations for some paths of varying degrees of inhomogeneity, 
but of constant overall numerical distance. Table 2 shows 
clearly that, as expected, the errors of the method rapidly 
diminish when the inhomogeneity of the path becomes small. 
For land-sea paths with a land section representing a large 
numerical distance (|s;/| > 1), the method of equivalent 
secondary sources could be checked against the formula derived 


by Feinberg,?* 
1 2i Ri Di, | 
as gee — D ; 
wale u /7 (s l; ) 


The results of calculations have shown that only for x; < D 
“nay the first section of the path be replaced by a single secondary 
source. If x, is not very much smaller than D, two or three 
secondary sources must be used. When secondary sources are 
placed on the sea, the sea section may be replaced by two 
sources even if it constitutes about 80% of the whole path; in 
“uch an extreme case the errors amount to about 10%. When 
secondary sources are placed on land, the errors are much 
-igger and a finer subdivision of the land section is necessary. 


(57) 


w(D) = 
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Some indication as to the proper division could be obtained 
from the calculations of the attenuation function for land-sea 
paths with land and sea sections of equal length. The most 
difficult case has proved to be when the parameter s, was real 
and s,J; ~ 50; it was then necessary to use three secondary 
sources in order to reduce the errors to about 10%. When s, 
was real and s;,/; was very great (say, some hundreds), and also 
for imaginary s, and any great value of |s,/)|, two secondary 
sources could be used and the errors did not exceed about 5%. 

The explanation of this behaviour seems obvious: the errors 
depend upon the curvature of the v(p)-curve, the distribution of 
the secondary sources along the path and the changes of the 
attenuation function with distance. Consequently, with growing 
numerical distance, a more elaborate procedure must be adopted, 
unless the numerical distance is so great that the influence of 
the parts of the section lying far from the transmitter may be 
disregarded. 

Essential for success is the way of subdividing a section into 
parts. Useful experience has been obtained with the division 
as shown in Table 3. 


Table 3 
THE DIVISION OF THE FirsT SECTION OF A PATH INTO PARTS 


Two parts Three parts 


xD [xt Zee zed bo 


Land 
|s7j| = 


i> 
10 


0-1/0-15 
0:06 


50 
very great 


Sea (si 0) 


O23 


: x? are the distances from the transmitter to the 


end of the first section and to the end of the first and second 
parts of the first section, respectively; J) is the length of the 
land section. 


Interesting results are obtained when both sections of the 
path represent very great numerical distances [|s4x4| = 1; 
|s.(D — x,)| > 1]. As mentioned before, the first section of 
such a path can be replaced by one secondary source only when 
the first section is comparatively short; in the general case more 
secondary sources, placed at appropriate points &, &?,... 
are necessary. Nevertheless, we will replace the first section by 
one secondary source p; which we will situate at the same point 
as the transmitting antenna. From eqn. (33), 

Ste Ue x1) 
VS1 


For large numerical distances we may, from eqn. (35), make 
v(s;x,;) x — 1, which means another approximation, sometimes 
also quite rough. Instead of pp we now have the effective source 


Def, given by 


Pp Po (58) 


Ss 
Pes = Bo + Pi = Por|? (59) 
The field at the point of reception is then 
— Pef Po =i 60 


since y(s,D) ~ — 1/(2s,D). This gives, for the attenuation 


function, 
j 


= 61 
24/ (515 )D ou 


w(D) 
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which is independent of the lengths of particular sections. The 
same result has been obtained by other methods by Feinberg* > 
and Bremmer.? 

The argument used is apparently independent of the numerical 
distance of the second section of the path; it therefore seems 
possible to assume s, = 0. Proceeding as before, and taking 
for v(s,x;) only a better approximation, 


i 
/(a751X1) 
Dp lee 

a/ (77841) 


(62) 


(54x) = 1 


we obtain finally w(D) = (63) 


Comparison of eqn. (63) with the rigorous formula (57) shows 
that the result obtained is correct only in the case of com- 
paratively short land sections, representing very great numerical 
distances. 

It may be instructive to explain why eqn. (61) gives com- 
paratively small errors while eqn. (63) can be sometimes of no 
practical value. As follows from the method of deriving 
eqn. (22), the concept of equivalent secondary sources is 
especially suitable when the length of the section is small com- 
pared with the distance to the receiver. In the cases just con- 
sidered this condition is fulfilled only when the first section is 
comparatively short (x; < D); the method of equivalent 
secondary sources then gives strictly correct results. When the 
lengths of both sections are comparable, the secondary sources 
on the parts of the first section near the boundary are much 
nearer the receiver and have therefore a much stronger influence 
on the field than when x; < D. However, if in these cases also 
we use the same procedure as before, the errors of the method 
must increase. If the second section is sea (s, = 0), then 
P, ~ — Po and the secondary sources cancel almost completely 
the primary source pp; the errors which are introduced when 
replacing the distributed secondary sources by a single equivalent 
secondary source are then the most pronounced. If the second 
section is land, the cancellation is not so complete and the 
relative errors are then much smaller. 

Because of the practical importance of eqn. (61), we will 
discuss it now from a somewhat different point of view. As 
mentioned before, eqn. (61) states that the attenuation function 


to eqn. (7) we obtain 


In Tye 
Aw(D) 1 ~ infos — vs) y(s,x)y[ so OV ix I (64) | 


V[x(D — x)] 


For attenuation functions y(s;x) and y[s,(D — x)] we use the | 


asymptotic expressions valid for large numerical distances 
1 


i 
Wsx) = — 2s,x° y[s2(D ai x)] ae 2s(D — x) . (65) 


which give 


Aw(D) = 5 ae 2h — D D— 2 i 


n/n 3453327 LW/[D =a A on 


Aw(D) with w(D) as given by eqn. (61); we then obtain 
Aw(D) _ yi 4/84 — V'82 D—2i, 
wD) Va v(s182D), V[h(D —1)] © 

Eqn. (67) illustrates the influence on the field of the middle parts 

of a path composed of sections representing great numerical 


distances. It also gives a useful indication as to the errors of 
eqn. (61). 


(5.3) Three-Section Paths 


The general problem of a 3-section path may be solved in © 


two ways: by assuming either sp = 53 or so = Sz. In the first 


case the secondary sources over the first and second sections of | 
the path must be calculated, and this is in principle always | 
possible.!> In practice, however, computation of the secondary — 
sources over the second section is fairly laborious except when © 
the second section is comparatively short. This is why such a 


case only was previously considered [eqns. (41)-(51)]. 


According to Fig. 3 the influence of the middle parts of the . 


path is least when the path represents a great numerical distance. 


If we consider two paths, land-sea—land and sea—land-sea, the © 


second section of the path will have a much smaller influence 
in the first case than in the second case. Consequently, if we 


replace the second section by one secondary source, the errors — 


Table 4 


THE Ratio w(D),;/w(D), FOR CERTAIN LAND-SEA 3-SECTION PATHS 


52 


km-—1 


w(D)ss : w(D): 


> (land) \ Ce 


\ 1 O2007 9a soe 
(land) 0-013 1 7 —184° 


h es } (land) 


1 0 0-47 /—11° 
(sea) 0-336 7 —12° 


All parameters s are real. The second section of the path is replaced by one secondary source. X1, X2, D are the 
distances from the transmitter to the end of the first, second and third sections, respectively. 


is independent of the lengths of particular sections; this can 
constitute surely only an approximation. In order to explain 
this we will consider a path of length D, composed of two sec- 
tions (with parameters s; and sy) representing great numerical 
distances. We will consider two cases, namely when the first 
section has the lengths /; and /,, and will calculate the difference 


will be much smaller and the applicability of the method much 
wider for land-sea-land paths than for sea—land-sea paths. 
Table 4 confirms these conclusions. It also gives indications as 


to the limits of application of the method discussed. The 


examples in Table 4 have been chosen so as to make possible 
the comparison with two approximate formulae derived by 


Aw(D) of the corresponding attenuation functions. According | 


(66). 


To simplify the formulae we assume 1, = D — 1; and compare | 


(67) | 
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Feinberg.?-° In the cases considered the errors of Feinberg’s 
formulae [w(D), in Table 4] should probably not exceed a few 
‘per cent. 

In the second case (sp = 5), the first and third sections of 
the path must be replaced by suitable secondary sources. 
Another procedure, however, is also possible. If we replace 
the first section by a number of secondary sources p\!’, Piste. 
we may then formally regard the path as consisting of two 
sections with parameters s, and s; and compute the field as the 
superposition of the fields of all the sources po, pi, p?, . 
These partial fields we can calculate according to the reciprocity 
theorem, i.e. by placing all these sources at B and calculating 
the fields at points A, €4!, €2,... (Fig. 15). We therefore 


G) @) @) 
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Li glow VILIN % 6%, 
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Fig. 15.—Computation of attenuation function for 3-section paths 
when so = S52. 


replace the third section of the path by suitable secondary 
sources, aS was done in the corresponding 2-section problem. 
The procedure described is in principle quite general. It is 
especially suitable when the number of secondary sources used 
is small. This occurs either when the first and third sections are 
comparatively short, or when they represent very great numerical 
distances. 

We will first consider the case when the first and third sections 
are comparatively very short, i.e. when x, < Dand D — x, < D. 
According to eqn. (33), we replace the first section by an equi- 
valent secondary source p;, which we place at A (see Fig. 15): 


4/8, — v/s 
Pi = Po Cee 20(51X}) (68) 
At A we have now an effective source p,r, given by 
} s, —v/s 
Pea Dot P= Po| | sf Vo era (69) 
V 51 


To compute the field at B we use the reciprocity theorem, i.e. 
we place p;, at B and calculate the field at A. The third section 
we replace by an equivalent secondary source placed at B, and 
so obtain at B an effective source p,;, given by 


Def = Pol | + var Vote(s) | Si VEY eyfotD s xa} 
(70) 


This gives, for the attenuation function over the path con- 
sidered, the final expression 


w(D) = vioaD)| | + Vee) | 


53 — 0/8 

{1 en SEY 22,1) = ~} 
V3 

We will now consider the condition when all three sections of 

tie path represent very great numerical distances. According 

> the discussion in Section 5.2 we may, in this case also, replace 


VoL. 105, Part C. 


(71) 
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the first and third sections approximately by single secondary 
sources, placed at A and B. Putting in eqn. (71) v(s,x,;) ~ — 1, 
v[s3(D — x2)] ~ — 1 and y(s,D) = — 1/(2s,D), we obtain 


1 
2y/(s153)D 


Like eqn. (61) this formula has also been derived by other 
methods by Feinberg?» ° and Bremmer.? 

From eqn. (72) it is seen that the field does not depend on the 
length of the second section of the path. As follows from the 
comparison of eqn. (72) with eqn. (61), the attenuation function 
would be the same even in the absence of the second section. 
This means that the influence of the secondary sources from the 
second section of the path has been completely disregarded; 
this has, in fact, resulted from the assumptions v(s,x,;) = — 1 
and 2[s;(D — x,)] = — 1. 

Summarizing the discussion of the 3-section problem, we see 
that in all cases the attenuation function may be computed by 
means of equivalent secondary sources; in some cases the cal- 
culations may be, however, somewhat laborious. Of great 
advantage for the calculations is the fact that the middle parts 
of the path have a comparatively small influence on the field. 
Thus, the secondary sources over those parts of the path need 
not be calculated especially precisely. 


w(D) = (72) 


(5.4) Multi-Section Paths 


The secondary sources may in principle be used also for the 
computation of the attenuation function for multi-section 
paths.!5 The calculation then becomes laborious and some- 
what complicated and generally offers no serious advantage as 
compared with the numerical integration of eqn. (7). In two 
special cases, however, a simple and interesting discussion is 
possible. 

As the first example we will consider an overland path com- 
posed of 7 sections, of which the first and the nth represent great 
numerical distances. The middle parts of such a path have 
only a small influence on the field; accordingly we may approxi- 
mately replace the sections 2, 3, ..., m — 1 by a single homo- 
geneous section of some appropriately chosen parameter s,,. 
From eqn. (72) the attenuation function is then 


1 
* 2/(515,)D 


which represents a direct generalization of eqns. (61) and (72). 
The parameter s,~, has dropped out of the final formula (73); 
this means, however, that we have in fact completely disregarded 
the influence of all sections of the path except the first and the 
last. This conclusion is supported also by comparison of 
eqn. (73) with eqn. (61) for 2-section paths. 

As the second example we will consider a path composed of n 
sections, of which the last one of length 6 is comparatively very 
short. We will assume sy = s,_; and replace the sections 
1,2,...,2 —2 by appropriate equivalent secondary sources. 
We have thus transformed our problem into a 2-section one 
with a number of sources of radiation; the field at the point of 
reception we find as the superposition of fields of the primary 
source and of all secondary sources. Further analysis is quite 
similar to the case of eqn. (56): the presence of a short section s,, 
may be taken into account by introducing a factor 


3 V Sn EE li(s,,0) 
V Sn 


As this factor appears in the formulae for all partial fields, it 
may consequently be used also for the total field. The attenua- 
16 


w(D) ~ (73) 


ie: 
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tion function for the case considered, i.e. for the points near the 
boundary, thus takes the form 


w(D) = ¥ Doni | ee ee . (74) 


where w(D),,-ss,_, denotes the attenuation function when the 
section n — 1 would reach to the end of the path, i.e. when s, 
would be equal to s,_,. On the basis of the reciprocity theorem, 
eqn. (74) may also be used, with obvious changes in notation, 
in the cases when the first section of the path, i.e. the section 
lying close to the transmitter, is very short. This result could 
also be derived directly by replacing the first section by an 
appropriate secondary source and calculating p,, similarly as 
in eqn. (69). The analysis in this case must be, however, more 
careful, because it is necessary to discuss the influence of the 
secondary sources from the parts of the second section near the 
boundary with the first section. As we have already obtained 
the desired result, we will not consider the problem any more 
from this point of view. 

Eqn. (74) represents a generalization of the results obtained 
in Section 5.2 for 2-section paths. The limits of application of 
eqn. (74), as well as its errors, can be estimated on the basis of 
the remarks in Section 5.2. We thus obtain the condition that 
the length 5 should be much less than the distance from the 
point of reception to each of the sources. In practice, however, 
near but weak sources often need not be taken into account, 
and this means a wider application of eqn. (74). The amplitude 
and phase changes of the wave on passing the boundary are 
discussed in Section 5.2. 


(6) LONG MIXED PATHS 

(6.1) General Considerations 
For longer distances the plane-earth approximation is not 
satisfactory and the influence of earth curvature must be taken 


into account. The distances d up to which the earth may be 
considered plane are as follows: 


Reference 27: d< 12A)13 , (75a) 
Reference 28: d= S(AUGs (75b) 
Refezence 29: d< TA! . (75c) 


In these formulae d is in kilometres and A in metres. 
For a homogeneous spherical earth we have at present three 
formulae?®: °° 2° for the attenuation function w(D): 


For small numerical distances p, 
wD) = 1 — i(h/7)p'!? — 2p + iy/m)(1 — 483) p3/? 


+ (1 — 8p? — Za —ga9p9? —... 76 
For any numerical distances p, 
w(D) = y(p) + 43[(1 + 2p)y() — 1 + iG/m)p!?]83 
+ [Gp? — Dy(p) — id/m)p'71 — p) 
+t —2p + ep*|oo--s -. (77) 
The residue series 
Qi 1/2 = exp [— it,(ka)'"/3D/a] 
= ae 1/6 q 
wD) = D'2(— = ) a) 2 ae ey 
é (ka)tIé 
where o= ‘(kay I3(e! = 112 a (2as)!/2 exp (— iim) (79) 


and 7, denotes certain complex values depending on 6. 
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The series (76) is useful for small numerical distances, as then 
only a few terms of it need be considered; in such cases eqn. (76) 
differs, however, insignificantly from the corresponding plane- 
earth formula. In subsequent analysis we will therefore not 
take eqn. (76) into account. Similarly we will disregard eqn. (77) 
because of its very complicated form. 

As regards the residue series, it may be easily used in mixed- 
path problems because of its convenient analytical form; Wait’s 
research!? of 2-section paths may serve here as an example. 
However, in a general case, when many terms of the series must 
be used, the calculations become laborious because of the many 
complex parameters appearing in the formulae. In the subse- 
quent analysis we will discuss explicitly only those cases in 
which it is sufficient to consider merely the first term of the 
residue series, i.c. when the distances are so great that the 
receiver is in the diffraction zone. According to Millington?! 
these distances are ‘ 

For errors < 3dB 


d> 20(A)1/3 (80a) 


For errors < 1dB d> 30(A)'/3 (805) 


where, again, d is in kilometres and A in metres. In such cases 


w(D) = »/(D)C exp (tD) (81) 
with aig Ema A (82), 
7 oe Ogee IDae (83) 
t = — if(ka)"3/a]ry = — ik(ka)—23 79 
= k(ka)-213(— icy — Bo). (84) 


Values of a and py have been given by Norton.?’ 


(6.2) Two-Section Paths 


 eeee———eEEeEeEeEeEeeEeEeEeEee 


' 


: 
. 


As follows from the remarks in Section 6.1, the general case 
of a 2-section path with arbitrary lengths of the sections will 


present very great analytical difficulties. 


A comparatively — 


simple analysis is possible in two instances: first, when one 
section (say the first) may be regarded as plane and is, at the - 
same time, very much shorter than the other, and secondly, 


when at least one of the sections is very long. 


In the first case, ie. for x; < D, a seemingly obvious general- 
ization of eqn. (55) gives 


Ver Vou 1x >| 


w(D) = v(D)| 1 —- 
Sy 

Closer examination shows, however, that eqn. (85) is valid only 
when Ws,(D —x) is approximately constant over the first 
section; this problem will be discussed in more detail in the 
subsequent analysis. 

In the second case we will assume the second section to be so 
long that the attenuation function ws,(D — x) over the first 


(85) 


may be calculated from eqn. (81); consequently the length of 


the second section (D — x,) must be at least that given by 


eqns. (80). The parameters referring to the first section will be 


denoted by the subscript 1, and those for the second section by 


the subscript 2. Similarly as in Section 5.1 we will calculate the 


contributions to the attenuation function, and to the field, 
originating from the part of the first section lying between the 


| points d; and d,. Assuming sy = 55, we obtain, by virtue of 


eqn. (7), 
ED d, ar 
waa = — ia) 20/51 ~V |, ae aT dx 
WD) uae — vay fs me 8 exp (—t)x)dx . (86) 
pand Ey 4g, = AW 
sD . 
me ay Porn. Vie vey | Mah Wail 2 exp (— t>x)dx . (87) 


According to eqn. (87) the part of the first section considered 
| may be replaced by an equivalent secondary source Pad» Placed 
. at the same point as the transmitting antenna, given by 


we x) (88) 


Pad, = exp (— t)x)dx 


Po ease i fap | Ma 
_ As the distances d; and d, are quite sie we may conclude 
that when the second section of a 2-section path is very long 
the first may always be replaced by one equivalent secondary 
source placed at the same point as the transmitting antenna. 
This property may be generalized in an obvious way for ail 
paths having very long last sections. In such cases all sections 
preceding the last one may be replaced by a suitable secondary 
: source placed at the beginning of the path. 

When the first section of a path may be regarded as approxi- 
mately plane we may replace w,,(x) by y(s,x). As in Section 3.3 
'we may expand exp (— f,x) in series and then obtain, as a 
‘ generalization of eqn. (33), 


es V $25 t8ti(s4dh) — v(s;d,)] (89) 


S 

Pdjd, = po MNS 

where € is the position of the ‘centre of gravity’ of the part of 

{the path considered; the secondary source itself is, however, 

placed as before at the beginning of the path. According to 

eqn. (89) the simple formula (85) may be applied only if e~ 21 ~ 1; 

iif this condition is not fulfilled the first section must be sub- 
divided, even if x; < D. 

We consider now the case when both sections of the path are 

\ very long, as in Fig. 16. When calculating the attenuation 


Fig. 16.—Path composed of two very long sections. 


function we first place the transmitter at the point A, assume 

Sy = S, and divide the first section into two parts. The first 

part we choose so long that for x > x{) the attenuation function 
w (x) may be found from eqn. (81). As w,,(D — x) also has 
the same form, we obtain, after easy reductions, 


“(DY = /(D)C, exp (1D) 


V5 — v/s2 
1) 


I 
{4a — Je CES (90) 


740 OF le xi} 


THEORY OF GROUND-WAVE PROPAGATION OVER AN INHOMOGENEOUS EARTH 


461 
“with 
(1) 
i aw (o3) 
Ay.=1- (/s vs) —l exp (— bx)dx 
1,2 ee seid p (— fx) 
G 
- 1_ exp [(t, — meh (91) 
hi ib 


If instead of sg = s2 we assume sp = 5, and place the transmitter 
at B, we obtain similarly 


w(D)"" = (4/D)C, exp (t,D) 
{4 a ed exp [(t, — t(D — x} . (92) 
with 


D— x) 


4p ee - Fas viol) a 


exp (— t,x)dx 


(93) 


2 exp [(t. — t(D — son} 
boils 
According to the reciprocity theorem (see Section 3.2) w(D)! = 
w(D)’". As the term A, is a function of x{! and A, , is a 
function of D — x}, they are independent of chick other; as a 
consequence of afte condition w(D)’ = w(D)” they both must be 
equal to zero. We thus obtain the interesting result that for a 
path composed of two very long sections the attenuation function 
is given by a simple formula 
VIALS 
w(D) = (WDCC Vsia Vs: ae oe 52 exp [tx + t(D — x) . (94) 
ty 

For further discussion we shall write eqn. (94) once more in a 

somewhat modified form: 


wD) = w,,(D) > exp [(t, — t))x,] . (95) 


oe 51 — V5) 
ae paat2 
According to eqn. (95) the Oe composed of two very long 


sections may be reduced to a homogeneous path of pea 
S = Sp, but instead of pp the primary source is then py , given by 


AWN oe owe Sy —V/s2 
ty 


96 
Pef = PoC a ( ) 


exp [(t, — t))x] 


Eqns. (85), (89), (88) and (95) describe the influence of the first 
section of the path on the field. The influence of short sections 
is the same as for a plane earth. When the length of the first 
section increases, there appear increasing deviations from plane- 
earth relations. For very long first sections eqns. (94) and 
(95) are valid, and, when considered as functions of x;, represent 
a logarithmic spiral winding around the point O. The influence 
of the first section of the path on the field may be plotted as a 
function of x, in the plane of complex variables. We thus 
obtain interesting curves of different shapes depending on the 
relations between the electrical parameters of both sections; 
Fig. 17 shows an example. 


(6.3) Multi-Section Paths 


First we will consider the case when all sections of the path, 
except possibly the first and the last, are very long. As pre- 
viously, x;_;, x; will denote the ends of the jth section. The 
parameters referring to the jth section will be given the subscript /j. 

When computing the field over the second section very far 
from the boundary with the first, we may, according to the dis- 
cussion in Section 6.2, replace the first section by some appro- 
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Fig. 17.—The influence of the first section of a very long path on the 
field: w(D)/ws2(D) = f(x1). 


land (€ = 4; o = 10-2mho/m). Second section: sea (€ = 80; 
Frequency: 1 Mc/s. Numbers denote the length of the first section 


First section: 
o = 4mhos/m). 
in kilometres. 


Correct result; — — According to eqn. (85); —:— According to eqn. (95). 


priate equivalent secondary source py. We may then consider 
the path as homogeneous of parameter s = 5, with the effective 
primary source pS? given by 


PY = py + Dy (97) 
For very short first sections eqn. (85) gives py directly : 
Ss, — “o/s 
py = Po | 4- — _—___ v i “/§ a afi oxo. (98) 
1 


For longer first sections p, must be calculated according to 
eqns. (89) and (88), and then p‘? according to eqn. (97). For 
very long first sections Dy is given by eqn. (96). 

When calculating the "field over the third section we may 
assume So = Ss, and evaluate the influence of secondary sources 
from the first and third. As mentioned before, the field of the 
primary source po and of the secondary sources from the first 
section may be reduced to the field of an effective primary 
source PS. The secondary sources over the third section, and 
consequently the field over that section, will thus be the same 
as in the case of a 2-section path with first section of length x, 
and parameter s = s2, the second of parameter s = 53, and 
primary source p\. 

When the third section is shorter than the lengths given by 
eqns. (80), the field may be calculated according to the 
reciprocity theorem by placing po at the end of the path and 
computing the field at its Genin: Again, the influence of 
the third section may be taken into account by replacing p‘? by 
an appropriate p>, which gives 
Py 


3) 
ELD) =~, WsAD) 


(99) 


and 


w(D) = n.(D) af = w,,(D) Pep Pa Pal ~ £ *@00) 
ef 
When the third section is very long we may use directly 


eqn. (95), which gives 
ai oe Sa A/ 53 
7 


bats 


d 
w(D) = w,,(D) Pes 
Po 


exp [(ty — t3)x2] . (101) 


It follows from eqn. (101) that the 3-section path considered 
has been thus reduced to a homogeneous path with parameter 
s§ = 53 and with a primary source Dv? given by 


V5. — V/'53 
Damas 


(De 
(2) Pef lo 


LEI OG Aire FA 


exp [(t — t3)x2] . (102) 
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From the above discussion it follows by induction that in | 
the general case also of a path composed of many very long | 


sections, except possibly the first and last, the field may be cal- 
culated as over a homogeneous earth, while the influence of 
subsequent sections of the path appears in the form of certain 
factors in the expression for the effective primary source. 


When the path consists of m sections, all very long except ’ 


possibly the first one, the path may be reduced to a homo- 
geneous earth s, with an effective primary source pl} ) given by 


pi-) = py Py ~ic ves /s3 —i Cie VS4 
2 Po Va? — 3 Va taverns 
a ox sees i ae Vn exp [tox + t3(x3 = Xp) + ARs 
VT ti aetn 
ar peg nes a2 bee) in ene | (103) 
The attenuation function over the path considered will therefore 
be 
1) Das 
ea Pes ove V'S3 
w(D) = w, (D = 4/D)G, 
(D) = ¥5,(D) V DG Cees 
ee V54 yg DG iia: 
ine Ui V1 tn-1 — 'n 
exp [tox2 + t3(x3 — X2) +... +4,(D —%n_1)] . (104) 


We will now consider a path which differs from the one just 
discussed only by the presence of a not very long (m + 1)th 
section. 
3-section paths [see eqns. (99) and (100)], and consequently the 
attenuation function for the path will differ from eqn. (104) 
only by a factor accounting for the presence of the (m + 1)th 
section. 


When all x sections of the path are very long we substitute in ~ 
eqn. (104) for p{P/py the formula (96) and obtain for w(D) a ~ 


simple expression, 


WD) = (/D)CseC aN Ti CV BANS 
Pal: es 1 Asn 
Soe ee 


exp [t)x, + (x2 — x1) +... + 4,(D — x,_)] . (105) 


For practical computations it is convenient to change the 


notation to that used in the homogeneous-spherical-earth theory — 


[see eqns. (79), (82), (83) and (84)]. We thus obtain 
(/D)C, = eM QrkD)'!?(ka)— (279, — 1/52)~! 


=i cM Si ee 1/8; — 1/8, 
/ 7 : t; — tr (27; = 1/3?)(79; = Tox) 


exp (7x) = exp [— ik(ka)~?3x9,x] exp [— k(ka)~7/3Bp,x] . (108) 


Eqn. (105), in the notation according to eqns. (106)-(108), has 
been obtained by Furutsu'*-!4 by another method. Eqn. (94) 
previously obtained represents a specialization of eqn. (105) for 
2-section paths. 

Finally, we will discuss the influence on the field of a com- 
paratively short section which is neither the first nor the last. 


. (106) 


The situation is quite similar to that discussed for - 


(107) © 


cues 


We will consider first a 3-section path with very long first and 


third sections and a comparatively short second section. From 


eqn. (7) the contribution of the second section to the attenuation © 


function for sg = s; amounts to 
-if2 (V/s. — veo 


*wOx)w.(D — 
VIR = Se 


Wrixs = 


. (109) 
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‘On the basis of the reciprocity theorem we may use eqn. (85) 
‘for w(x); after an obvious change of notation we obtain 


V/'82 = 
V'S2 


VSI 


w(x) & v4! + v[so(x xh ee CULO) 


The limits of application of eqn. (110) as well as its error should 
be clear from the discussion in Section 6.2. After substituting 


for w,,(D — x) from eqn. (81), eqn. (109) may be reduced to 
LAW, = w(DIC BV vs 53 exp [(t, — t3)€>] 
i co 
x {- Te es VV ale = x} CULE) 


where €, denotes the position of the ‘centre of gravity’ of the 
second section, and wu is the function determined by eqn. (45). 
Similarly, as in the case of the plane earth [see eqn. (44)], we 
may assume § = 4(x, + x). It follows from eqn. (111) that 
ha comparatively short second section may be replaced by the 
equivalent secondary source p>, placed at the beginning of the 
path, given by 


a V3 


exp [(t; — t3)&] 
V's4 


Ps = PoC 


x{-S. soy ny Tete 


This makes possible an easy calculation of the field over the 
tiurd section at points lying very far from the boundary (x — x) 
‘very large). We therefore assume sg = 53, replace the first 
and second sections by suitable equivalent sources p, and p>, 
and calculate the effective primary source p{*” given by 


ulso(x2 — 0} bila) 


DY? = po + Py + Po (113) 


The path may then be regarded as homogeneous with a para- 
meter s = 5s; and a primary source pit”. For source p we 
substitute in eqn. (113) from eqn. (112). To find the influence 
of the first section, we may calculate directly py + p, from eqn. 
(96), only changing the subscripts 2 into 3, i.e. 


s Ss 
Po + Pi = Pow ov ee > exp [(t, — t3)x1] . (114) 
Tae i 18 
The generalization of this procedure is very simple. In fact 


the only important assumptions have been that the section 
)considered is comparatively short; that the preceding and the 
\following sections are very long; and that we can replace the 
influence of all preceding sections by a single source of radiation 
placed at the beginning of the path. The calculation of the field 
\over the paths satisfying these conditions (e.g. those composed 
‘of successively very long and comparatively short sections) is 
‘carried out step by step by subsequent reduction of the path to 
‘a suitable homogeneous earth and the computation of the 
‘corresponding effective primary sources. 


(7) CONCLUSIONS 


The discussion presented in the paper is based on the integral 
eguation (7); its physical background is the interaction of 
secondary sources distributed over the surface of the earth. The 
problem is, physically and mathematically, complicated. Some 
simplification is possible through replacing the distributed 
secondary sources by equivalent secondary sources; for compli- 
c.ted mixed paths, however, this method also proves unsatis- 
factory. As mentioned in the Introduction, Furutsu and Wait 
heve performed rigorous calculations for some cases and have 


presented the results in the form of graphs. This method of 
attack seems impossible in the general case because of the large 
number of independent parameters. Even in the simplest case 
of plane 2-section paths the consideration of the complex 
character of numerical distances increases the number of para- 
meters by two, as compared with the set of graphs computed by 
Wait!® under the assumption of real numerical distances. In 
this respect the method of equivalent secondary sources is of 
advantage, since it is valid for every phase characteristic of the 
parameter s. Another advantage of this method is the clear 
physical picture it gives of the phenomena occurring; this may 
sometimes be a real help in practical computations. 

Summing up the discussion, we must regard the problem of 
practical mixed-path calculations as only partially solved so 
far and needing further investigation. In this respect an 
interesting suggestion has been made by Senior,!’ but further 
studies are necessary to decide whether his method will be suitable 
for practical computations. 

In view of the laboriousness of the theoretical computations, 
the so-called semi-empirical methods can be of great practical 
importance. In another paper the two most important of them, 
namely Millington’s method and the equivalent numerical- 
distance method, will be compared with theory in order to 
determine their errors and to find the limits of their application. 


(8) ACKNOWLEDGMENTS 


The author wishes to thank Mr. G. Millington for his many 
valuable remarks and very great and kind help in preparing the 
manuscript for publication. The author is indebted to Dr. T. 
Tomankiewicz for his friendly support during the course of the 
work and to the Director of the Institute of Telecommunication, 
Docent T. Rzymkowski, for his kind permission to publish the 
paper. The author is also grateful to Dr. J. R. Wait for his 
valuable remarks. 


(9) REFERENCES 


(1) GRUNBERG, G. A., and Fock, V. A.: ‘On the Theory of 
Coastal Refraction of Electromagnetic Waves’, from 
‘Investigations of Propagation of Radio Waves’ edited 
by B. A. Vvedensky (Academy of Sciences, U.S.S.R., 
Moscow, 1948), p. 69. 

(2) FEINBERG, E. L.: ‘Propagation of Radio Waves along a 
Real Surface’, ibid., p. 97. 

(3) ALPERT, J. L., GINzBURG, V. L., and FEINBERG, E. L.: 
‘Propagation of Radio Waves’ (GITTL, Moscow, 1953), 
Chapter IX, p. 184. 

(4) ‘“Ground-Wave Propagation over Mixed Paths—a Brief 
Review of the Present Situation’, C.C.I.R., Warsaw, 
1956, Document 501. 

(5) FemnBerG, E. L.: ‘Theory of Mixed Path Propagation of 
Radiowaves and Engineering Methods of Calculation’, 
ibid., Document 563. 

(6) CLEMMow, P. C.: ‘Ground-Wave Propagation across a 
Land/Sea Boundary’, Nature, 1950, 165, p. 107. 

(7) CLemmMow, P. C.: ‘Radio Propagation over a Flat Earth 
across a Boundary separating Two Different Media’, 
Philosophical Transactions of the Royal Society, A, 1953, 
246, p. 1. 

(8) ‘New Method of Calculating Ground-Wave Field Strength 
over Mixed Paths’, C.C.I.R., London, 1953, Docu- 
ment 141. 

(9) BReMMER, H.: ‘The Extension of Sommerfeld’s Formula 
for the Propagation of Radio Waves over a Flat Earth 
to Different Conductivities of the Soil’, Physica, 1954, 20, 
p. 441. 


464 GODZINSKI: USE OF EQUIVALENT SECONDARY SOURCES IN THEORY OF GROUND-WAVE PROPAGATION 


(10) Furutsu, K.: ‘Propagation of Electro-Magnetic Waves 
over a Flat Earth across a Boundary Separating Different 
Media and Coastal Refraction’, Journal of the Radio 
Research Laboratories (Tokyo), 1955, 2, p. 1. 

(11) Furutsu, K.: ‘Propagation of Electro-Magnetic Waves 
over a Flat Earth across Two Boundaries separating 
Three Different Media’, ibid., 1955, 2, p. 239. 

(12) Furutsu, K.: ‘Propagation of Electro-Magnetic Waves 
over the Spherical Earth across Boundaries separating 
Different Earth Media’, ibid., 1955, 2, p. 345. 

(13) Furutsu, K., and Komal, S.: ‘The Calculation of Field 
Strength over Mixed Paths on a Spherical Earth’, ibid., 
1956, 3, p. 391. 

(14) Furutsu, K.: ‘Ground-Wave Propagation over Mixed 
Paths—Calculation of Ground-Wave Field Strengths 
and Phases in Propagation over Mixed Paths’, C.C.LR., 
Warsaw, 1956, Document 322. 

(15) GopziNsk1, Z.: ‘Extension of Feinberg’s Theory to the 
Case of Electromagnetic Wave Propagation over an 
Inhomogeneous Spherical Earth and Introduction of an 
Approximate Method of Computation based on Equi- 
valent Secondary Sources’, ibid., Document 454. 

(16) Wart, J. R.: ‘Mixed Path Ground Wave Propagation: 1. 
Short Distances’, Journal of Research of the National 
Bureau of Standards, 1956, 57, p. 1. 

(17) Senor, T. B. A.: ‘Radio Propagation over a Discontinuity 
in the Earth’s- Electrical Properties—I’, Proceedings 
ILE.E., Monograph No. 192 R, August, 1956 (104C, 
p. 43). 

(18) SENror, T. B. A.: ‘Radio Propagation over a Discontinuity 
in the Earth’s Electrical Properties—II. Coastal Refrac- 
tion’, ibid., Monograph No. 210R, October, 1956 
(104 C, p. 139). 

(19) Warr, J. R.: ‘On the Theory of Propagation along a 
Curved Surface’, Transactions of the Institute of Radio 
Engineers, Vol. AP (to be published). 

(20) Bremmer, H.: ‘Applications of Operational Calculus to 
Ground-Wave Propagation, in particular for Long 


Waves’, Proceedings of the Institute of Radio Engineers 


(to be published). 


(21) Leontovicu, M. A.: ‘On the Approximate Boundary Con- | 


ditions for Electromagnetic Field on the Surface of Well- 
Conducting Bodies’, Reference 1, p. 5. 

(22) Leontovicn, M. A., and Fock, V. A.: ‘Solution of the 
Problem of Electromagnetic Wave Propagation along 
the Earth’s Surface by the Parabolic Equation Method’, 
Reference 1, p. 13. 

(23) Karpov, K. A.: ‘Tables of Function w(z) = Peg bl 4 e*dx 
in the Complex Plane’ (Academy of Sciences, U.S.S.R., 
Moscow, 1954). 

(24) Pressey, B. G., ASHWELL, G. E., and Fow er, C. S.: 
‘Change of Phase with Distance of a Low-Frequency 
Ground Wave propagated across a Coast-Line’, Pro- 


ceedings ILE.E., Paper No. 2082 R, July, 1956 (103 B, . 


p. 527). 

(25) Pressey, B. G., and ASHWELL, G. E.: ‘The Deviation of 
Low-Frequency Ground Waves at a Coast-Line’, ibid., 
Paper No. 2083 R, July, 1956 (103 B, p. 535). 

(26) Pressey, B. G., ASHWELL, G. E., and Fow ier, C. S.: ‘The 
Measurement of the Phase Velocity of Ground-Wave 


Propagation over a Land Path’, ibid., Paper No. 1438R, — 


March, 1953 (100, Part III, p. 73). 

(27) Norton, K. A.: ‘The Calculation of Ground-Wave Field 
Intensity over a Finitely Conducting Spherical Earth’, 
Proceedings of the Institute of Radio Engineers, 1941, 29, 
jos (O28), 


(28) BREMMER, H.: ‘Terrestrial Radio Waves’ (Elsevier Pub- . 


lishing Co., New York, 1949), p. 85. 
(29) ALPERT, J. L., GINZBURG, V. L., and FEINBERG, E. L.: 
Reference 3, p. 234. 


(30) Bremmer, H.: ‘The Evaluation of Ground-Wave Fields © 
for Short Distances’, C.C.I.R., London, 1953, Document ~ 


162. 

(31) MILLINGTON, G.: ‘Ground-Wave Propagation over an 
Inhomogeneous Smooth Earth’, Proceedings I.E.E., 
Paper No. 794 R, January, 1949 (96, Part III, p. 53). 


ae eS on ee ee 


pe EN EE EEE) ey ee oe 


( 621.371: 621.396.677.85 


The Institution of Electrical Engineers 
Monograph No. 300 R 
Apr. 1958 


© 


THE FIELDS ASSOCIATED WITH AN INTERFACE BETWEEN FREE SPACE AND AN 
ARTIFICIAL DIELECTRIC 
By J. BROWN, M.A., Ph.D., Associate Member, and J. S. SEELEY, B.Sc.(Eng.), Ph.D. 


(The paper was first received 20th December, 1957, and in revised form 28th January, 1958. It was published as an INSTITUTION MONOGRAPH 
in April, 1958.) 


SUMMARY 


_ In most respects artificial dielectrics behave in a similar way to 

' normal solid dielectrics. A complication is sometimes observed in 
that apparent phase discontinuities occur at the interface between 
free space and an artificial dielectric, such discontinuities arising from 
the excitation of reactive fields near the interface. It is shown that 
the behaviour can be analysed in terms of evanescent waves inside 
the dielectric and in free space, and detailed calculations are given for 

' the metal-strip type of delay dielectric. Measured values of the 
parameters of an equivalent circuit representing the free-space/artificial- 
dielectric interface are shown to agree with calculated values within 
the limits of experimental error. 


LIST OF PRINCIPAL SYMBOLS 


Ao = Free-space wavelength. 
A = Wavelength of a plane wave in a dielectric of 
permittivity e. 
a, c, d = Dimensions defined in Fig. 1. 
B, = Attenuation coefficient for nth mode. 
... = Incident-mode amplitudes. 
... = Reflected-mode amplitudes. 
t = Sin? (zd/2a). 
p = [1 —J(@/A) log, t]7!. 
Z,, Z2, = Mode wave impedances. 
L, M, N, B, T = 2 x 2 square matrices. 
V1, V2 = Mode voltages. 
ij, 7 = Mode currents. 
Y1, ¥2 = Propagation coefficients of modes in delay 
structure. 
a, Bj = Mode constants as defined in eqns. (24) 
and (25). 


Square matrices are indicated by upper-case bold-face type; 
column matrices by lower-case bold-face type. 


(1) INTRODUCTION 


In a previous discussion! of the properties of artificial dielec- 
trics it was pointed out that reactive fields are established near 
an interface between free space and the dielectric. These fields 
modify the reflection properties of the interface by introducing 
apparent phase discontinuities in the transmitted and reflected 
waves. The only example so far discussed in detail concerns 
the artificial dielectric formed by a stack of equi-spaced parallel 
conducting plates. An exact solution for the fields excited by 
a plane wave incident on this medium from free space has been 
provided by Carlson and Heins* and from this an equivalent 
circuit can be deduced to represent the behaviour of the propa- 
gating waves. In the present paper an approximate solution 
vill be obtained for the metal-strip dielectric,? which is one of 
the most useful artificial dielectrics for the construction of 
microwave lenses. 
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The metal-strip dielectric consists of a two-dimensional array 
of conducting strips positioned perpendicular to the electric field 
of the incident wave [Fig. 1(@)]. A good approximation to the 
properties of this material is obtained by representing each plane 
of strips by a shunt reactance, and to this order of approximation 
the reactive fields at the interface between the dielectric and free 
space can be neglected. If the successive planes of strips are 
close together, interaction occurs between the reactive fields 
associated with each plane and the equivalent circuit must be 
modified. A valid representation is a set of transmission lines 
corresponding to the plane wave and the possible evanescent 
waves which are coupled together at the positions of the planes 
of strips [Fig. 1(5)]. A general theory* for the behaviour of 
such a coupled system shows that it can be replaced by a con- 
tinuous region capable of supporting a number of independent 
modes, each typified by a propagation coefficient and a transverse 
field distribution in a similar way to the modes of a waveguide. 
These modes are excited at the interface between the material 
and free space. When only one mode can propagate freely, the 
effects of the remaining evanescent modes can be lumped together 
to give any of the possible representations discussed in 
Reference 1. Jn Section 2 the theory will be developed on the 
assumption that only the least attenuated of the evanescent 
modes need be considered, and this development will be used to 
give an equivalent circuit for the junction between free space and 
the artificial dielectric. 


(2) CALCULATION OF AN EQUIVALENT CIRCUIT FOR 
A FREE-SPACE/DIELECTRIC INTERFACE 

The calculation is divided into three stages: 

(a) The fields associated with one plane of strips are examined 
in detail, and particular attention is paid to the behaviour when 
an evanescent wave is incident on the strips owing to excitation 
from an adjacent plane. This represents a straightforward 
extension of the calculations used to determine the equivalent 
reactance of a plane of strips. 

(b) The results of (a) give the representation of the medium in 
the form of Fig. 1(5), and the second stage is to obtain from this 
the possible modes of propagation within the medium. This is 
effected by applying the results of Reference 4. 

(c) The final stage is the calculation of the reflection and 
transmission coefficients at the interface between free space and 
the medium, and leads directly to the desired equivalent circuit. 

To facilitate comparison with the experimental results 
described in Section 3, it is assumed that the metal strips are 
embedded in a dielectric of permittivity «. The dimensions of 
the strips are shown in Fig. 1. 


(2.1) The Fields associated with a Single Plane of Strips 


Conducting planes can be introduced in planes of symmetry 
to reduce the problem to that of a capacitive iris in a strip 
transmission line, Fig. 2(a). With the co-ordinate system shown, 
the only non-zero field components are £,, E, and H,. The 
field can be expanded as a set of modes, consisting of the TEM 
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| | (a) 


DIRECTION OF 
PROPAGATION 


bef eb 


Fig. 1.—Metal-strip dielectric. 


(a) Cross-section. 

(6) Equivalent circuit. 

The transmission lines 1,2,... N represent the possible waves in the dielectric support- 
ing the strips. At each plane of strips the waves are coupled together, this coupling 
being represented by the reactive networks X. 


(b) 


Fig. 2.—Parallel-plate line with capacitive iris in the plane z = 0. 


(a) Cross-section. 
(b) Equivalent circuit. 
Line 1 represents the TEM wave. 
Line 2 represents the least attenuated E mode. 


At the terminals A and B, the voltages and currents correspond to the transverse 
electric and magnetic fields immediately to the left and right respectively of the iris. 


mode and a set of E modes, all of which are evanescent provided 
that a is less than A/2, A being the wavelength of a plane wave in 
the dielectric base. The problem of calculating the equivalent 
reactance of the strip depends upon determining the field excited 
by an incident TEM mode. For the present analysis, a more 
general incident field, consisting of a plane wave plus the first 
E-mode, is assumed. This excites a reflected field and a trans- 
mitted field, so that 


B= Ge" + aje—°2 cos 27x/a) 
oe) 


+ 2 but? cos [2(n — 1)mxJ/a] whenz< 0 


ad (a, + bet Se (ay Sp by)e~ 822 COs (27x]a) 


+ me cos [27(n — 1)x/a] when z>0 (1) 


i 


where a;, a) are respectively the amplitudes of the incident 
TEM- and E-mode fields, 5, is the amplitude of the nth reflected 
mode, and 


ik = By = jalep)'? 
2 = A(n — 1)?2*Ja* — k?* 


(2) 


Dek. (3) 
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Eqn. (1) is written in such a way that the continuity of E,, across 
the plane z = 0, where the strip is located, is automatically 
satisfied. The component H, must be continuous in the gap 
between the strips, and this enables the amplitudes b, to be 
calculated in terms of the incident amplitudes a; and a). Under 
quasi-static conditions, i.e. if a is much less than A, the term ke 
can be neglected in eqn. (3), and the calculation of the values of 
b,, by, etc., can be made by a simple extension of the method 
used by Schwinger> for the capacitive iris. The details of the 
calculation are given in Reference 6, the result being 


ke » | l jp(l — tha/A a (4) 

bo} L2d—) a1 — p+? —1j}Le 

where t = sin? (zrd/2a) (5) 
p = [1 —JG@/A) log. t]~! (6) 


The values for the remaining amplitudes b, are not required. 

The matrix equation, eqn. (4), is now used to determine the 
form of the coupling networks in Fig. 1(6). The condition 
investigated in this Section is illustrated by Fig. 2(6), where 
transmission lines are used to represent the TEM and 
first E modes. 
jk and B, respectively and the impedances are taken as Z, and 
Z>, the wave impedances for the TEM and first E modes respec- 
tively. The voltages and currents, as defined in Fig. 2(6), are 
related by a reactance matrix, which can be used to give an 
equation‘ of the form 


vy vy 
LM 1 
Ona ieae V5 (7) 
: w A a ae 
ly Nae y 
iD) L 


where L, M, N are 2 < 2 square matrices and L is the transpose ~ 


of L. L is a real matrix and M, N are imaginary symmetric 
matrices. For the present problem, this equation can be simpli- 
fied, since the field component E, is continuous in the plane of 
the strips, implying that 


vy vi 
lig ® 
from which it follows that 
| Od Bes (9) 
M=0O. (10) 


I being the 2 x 2 unit matrix and O the 2 x 2 zero matrix. 
The normal procedure relating voltages and currents on trans- 


mission lines to the complex amplitudes of the incident and 
reflected waves gives 


Peli: 
Vy vj ue ay + ag (11) 
iy es 1/Z, 0 aq, — by 
be | 0 au i = a 
EI a oe 0 iG: + b, 
From eqns. (7)-(9), 
iy vy il 
=i /N) 
ear E. o 


The propagation coefficients for these lines are — 


eS ee eee eee 
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, and substitution from eqns. (11)-(13) gives 
Za = a S ne a 4] zee + by (15) 
a, — by ay + by ay + by 


‘where Z is the matrix Awe 
OnEZ > 


| Eqn. (1S) simplifies to 


[N +2Z-1] A age ne] ini ees) 
by a 


\ which, when compared with eqn. (4), shows that 


P+2N-1Z-t]-1 = —| eds! ip — tajr i 
2p(1 oe, t) 2j(alA\A —_ te + (2 25,5) 


(17) 


! from which N can be obtained. The matrix in eqn. (7) which 
‘ specifies the properties of the reactive coupling network is then 
( determined in terms of the strip dimensions and the operating 
‘ wavelength. 


(2.2) Possible Modes of Propagation in a Strip Dielectric 


When a number of transmission lines are coupled together at 
) regular intervals by identical networks, a set of independent 
| modes of propagation can be found equal in number to the 
| number of transmission lines. The voltages and currents at the 
| points midway between the coupling networks can be expressed 
; as a sum of these modes, each of which has a particular propaga- 
| tion coefficient and a definite relation between the voltages and 
( currents in the individual lines. This approach is an obvious 
( extension of the simple transmission-line analysis already referred 
| to, and represents a basic section of the medium of length c 
[Fig. 3(a)] as a set of independent lines [Fig. 3(5)]. From a 
_ general study of such coupled systems,‘ it can be shown that the 
' mode propagation coefficients are given by the latent roots of a 
matrix, which in the present problem becomes 


T =C?+ S* + SZNC eee ee ee (1S) 
(te, ae cos (kc/2) 0 (19) 
: ed iy | 0 eoch en jeer Teal 
j sin (ke/2) 0 
fin | 0 sinh aoe Soe 


The latent roots of J are obtained to a sufficient degree of 
approximation in Section 6.1, and, from them, the propagation 


_ coefficients of the two possible modes in the metal delay struc- 


ture* are given by 


cosh (y;c) = cos (ke) + (5) log. f sin | 


E ar “a =f)" sin (koje (21) 
cosh (y2c) = €2¢/21? Parrot; este!) LP fe 22) 


The first of these differs from the expression obtained from the 
simple transmission-line theory? only in the presence of the 
<econd factor. This factor tends rapidly to unity as c increases, 
confirming that the simple theory is valid when the separation 
“etween adjacent planes of strips is sufficiently large. Sub- 
stitution from eqn. (5) for ¢ and the use of the quasi-static 
“pproximation for 8, shows that this factor cannot exceed 
1 + (4a/A) exp (—27c/a)]. It is easily verified that this differs 


| ! | 
| 
=— C—e; 


} 
2 2 
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Fig. 3.—Metal-strip dielectric. 


(a) Basic section of equivalent circuit. The elements are those defined in Fig. 2(). 
(6) Representation of basic section by two independent lines corresponding to the 
two modes of propagation in the strip dielectric. 


from unity by less than 1% if a/A is less than 0-25 and c/a 
exceeds 0-75, conditions which are often satisfied in practice. 
Eqn. (21) will give more accurate values for the propagation 
coefficient y; when the strip dimensions do not satisfy these 
restrictions. 

In the applications of interest, the first-mode propagation 
coefficient, i.e. y;, is imaginary and cosh (y,c) is less than unity. 
The second mode is evanescent, and when approximations are 
made in eqn. (22) by neglecting the other evanescent modes it 
is found that 


4 
Yo = 2nla + ; log. (cosec ue) heed (23) 


Each of the two modes is associated with a particular transverse 
field distribution and this can be specified by the relative ampli- 
tudes of the waves on the two lines of Fig. 3(b). These relative 
amplitudes are obtained from the latent vectors of the matrix T 
and also give the transverse field distribution in the structure 
at points midway between adjacent planes of strips.4 These 
fields can be written 


Ee 04; + >; COS (27x]/ a) SA) 
Jol, — Bai aP Bo; COs (27rx]a) mae AE (25) 


where the suffix 7 indicates which mode is being considered. 

The calculation of the values of the constants associated with 
the two modes is carried out in Section 6.2. For the present 
purpose the general expressions can be simplified by assuming 
that the strip spacings are much less than the operating wave- 
length, and then 


Oo, = 401,11 — Nexp(—48.c) . . . . (6) 
oO aoe et) By” Gale en op en) 
ZB yy = kyo lk oh. So aes) 
Z, By. = — 2joea(a/A)1 — 1) exp (—48, 0) ae (05) 
ZB, = Zjoyskyc(1 — 1) exp (—48,0) Re el) 
DP engi Ve one eins. Ue a 


in which y, has been replaced by jk,. 

The remaining two constants, «,;,; and a5, can be regarded 
as the amplitudes of the first and second modes respectively in 
the artificial dielectric. They are determined in order to satisfy 
the continuity of the fields at the free-space/dielectric interface 
when a plane wave is incident from free space. 


(2.3) Behaviour at a Free-Space/Dielectric Interface 


The interface between free space and the dielectric is defined 
to be a distance c/2 from the first plane of strips. In this plane 
the fields excited within the dielectric consist of the two modes 
defined by letting i equal 1 and 2 in eqns. (24) and (25). The 
dielectric supporting the strips is also presumed to terminate at 
this plane, the discontinuity resulting from this being taken into 
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account by the choice of Z; and Z, as wave impedances within 
the dielectric. Suppose that the incident plane wave is given at 
the interface plane by 


E, =1; H, =1/Zo . (32) 


where Z, is the free-space plane-wave impedance. There is also 
a reflected plane wave of amplitude R,, and inspection of eqn. (24) 
shows that the continuity of E, across the interface plane requires 
a ‘reflected’ evanescent wave in the free-space region. If this 
has amplitude R>, the fields on the free-space side of the inter- 
face are 


(33) 
(34) 


E, =1 + R, + R, cos (27x/a) 
Hy, = (1 — Ry/Zo — (Ro/Zo2) cos 2rx/a) 


where Zo, is the wave impedance for the evanescent wave in free 


space. For the plane waves in free space and in the supporting 
dielectric 

ZolZ, = (€/€o)'? (35) 
and for the evanescent waves 

Zo2lZz = (€l€o)!? (36) 


to the degree of approximation being considered. 
Inspection of eqns. (24), (25), (33) and (34) shows that con- 
tinuity across the interface requires that’ 


1+ Ry = %1 1+ %2 (37) 
Rito} seen ee (38) 

1 — Ry = (€/eo)"7Z,(By, + Bia) (39) 
— Ry = (€/€0)"?Z4(Ba1 + B22) (40) 


’ Only the values of R; and «,;, corresponding respectively to the 
amplitudes of the reflected plane wave and the transmitted wave 
in the dielectric, are needed, and they can be found by eliminating 
the other unknowns from eqns. (26)-(31) and (37)-(40). The 
results are 


ko — ky 16jk% a cos* (md/2a) 
Fs q 2a 
i Ko tk, Ky tk)? AL+ (le) mca) (41) 
and ; 
2k 
Gols * Shek, 


8iky a cos* (zd/2a) 
f — iopcnig NM eiE NIG exp (—2nete (42) 


in which kp is the free-space plane-wave phase-change coefficient, 
and ¢ has been replaced by its value from eqn. (5). 

From these results the parameters of the equivalent circuit of 
Fig. 4 can be calculated. A convenient choice for the impedance 
of the line representing the incident plane wave is Z), and Z 


Fig. 4.—Equivalent circuit for free-space/artificial-dielectric 
interface. 


T, is a line of phase-constant ko and impedance Zp, representing the free-space 
plane wave. . 

T2 is a line of phase-constant k, and impedance Z representing the principal mode 
in the artificial dielectric. 

T2rminals A and B correspond to planes on the free-space and dielectric sides of 
the interface, respectively. 
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must then be calculated to give the correct magnitude of R,, as 
given by eqn. (41). To the order of approximation retained in 
the present analysis 

R, = |1 —nl/d +n) 


where n = k,/ko is the refractive index of the artificial dielectric. 

Hence, Z can be selected as Zp/n or nZy; the former is preferred 
as being identical to the value for a continuous dielectric of the 
same refractive index. 

No fresh information is yielded by considering the magnitude 
of «11, because conservation of power requires that the incident 
energy be shared between the reflected and transmitted waves. 
The amplitudes of these waves are therefore not independent. 

The extra lengths of line in Fig. 4 are selected to give the 
correct value for the phases of the reflected and transmitted 
waves, so that 


(43) 


I 


(44) 
(45) 


2koly pp == teh es Ry 


Kol, + ki == Tale Oia 


The 7 in eqn. (44) arises because Z is less than Zp (n exceeds 
unity), giving a phase change of 7 on reflection at the junction 
between the two lines in Fig. 4. The arguments of R, and «,; 
are both small, and to an adequate degree of approximation 


8a cos* (md]2a) exp (—27c]a) 
An? — 1)[1 + /eo)"7] 


(46) 


Kol, = 


(47) 


Two points of interest arise in this result. First, the lengths 
are independent of the wavelength, and secondly, since they are 
numerically equal but of opposite sign, the net result is equivalent 
to supposing that the dielectric extends into free space a distance 
l, beyond the physical interface. Kharadly’? has already sug- 
gested that this is a possible method of allowing for the reactive 
fields at the interface and has given experimental evidence of its 
validity. 


(3) EXPERIMENTAL VERIFICATION OF THEORETICAL 
RESULTS 


Detailed measurements on a strip dielectric have been carried 


out to verify the predictions of the theory given in the preceding 
Sections. Miles® has shown that there is a very close corre- 
spondence between free-space propagation at normal incidence 
on the type of structure considered and free-space propagation 
obliquely incident. His argument has been extended to show 
that experiments on artificial dielectrics of the strip type can be 
carried out in a rectangular waveguide, provided that all wave- 
lengths are interpreted as guide wavelengths.? The strip dielec- 
tric was therefore made from tin-foil 0-002in thick supported 


by expanded polystyrene of relative permittivity 1-035, the 


whole assembly being contained in waveguide No. 10. The 
dimensions of the strips [see Fig. 1(a)] were 


a=3:40cm 
c = 0:-508cm 
d =al2 =1:70cm 


The operating frequency was such that the wavelength in the 
empty guide was 15-S5cm (i.e. ky = 27/15-55) and in a guide 
filled with expanded polystyrene the wavelength was 15:00cm 
(kK = 2n/15-00). In the application of the theory, the effective 


relative permittivity of the supporting dielectric is therefore to 
be taken as 


e/éy = (15-55/15-00)2 = 1-075 (48) 
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(3.1) Measurement Technique 


Considerable difficulties arise in making measurements on a 
single interface, and a method was developed by which results 
could be obtained using sections of finite length. For each 
such section the magnitude of the refiection coefficient, | pl, and 
its phase, ub, referred to the input interface can be obtained very 
accurately by the Weissfloch or tangent method.!° The problem 
then reduces to finding a method of extracting the parameters 
of the equivalent circuit (Fig. 5) from the measured values of |p| 
and % taken for a series of different lengths, /. 


Ya ae 
= 
ae Is Bln, 


Fig. 5.—Equivalent circuit for section of artificial dielectric 
of length J, 


From Fig. 5 the impedance looking to the right at A is 
_ 74%o + JZ tan @ 


ZA Z+jZytan¢d ce. 

where b = kyl + 21) (50) 
The corresponding reflection coefficient is 

Zy—-Z,  j(Zée—Z) tang (51) 


bee ae eZ OZ) tan 


and the reflection coefficient at B, which corresponds to the 
free-space side of the interface, is 


5 2 Dh 
mee eo ep 
pa = Ile 2ZZy +j(Ze+Z tand 


a (Z3 — Z?) |tan 4] 
~ [42223 + (Ze + Z%)? tan? J]? 


from which it can be shown that 


exp (2jkol) (52) 


Hence |p| (53) 


2 Ip 4o_ 74) i, 
a= jap = (2 — z,) bind 
Zi 7b 
“3 a |sin [&,( + 21,)]] (54) 


It is assumed in eqns. (52) to (54) that Z is less than Zp. 
Eqn. (54) shows that if 2|p|/(1 — |p|)!/? is plotted against / a 
sinusoidal curve will result. Further, the zeros of this curve 
occur for 

(55) 


so that observations of the positions of two zeros will give both 
k, and/,, The maximum value of 2|p|/(1 — |p|?)!/? occurs when 
the sine function is unity and is equal to (Z§ — Z*)/ZZo, from 
which Z/Z, can be calculated. 

The experimental curve for the specimens tested is shown in 
Fig. 6, 2|p|/(1 — |p|*)"/* being plotted against the number of 
strips in the section. From this curve, using the above argument, 
it is found that 


7 7 
Tinie ees whence 1 1lko 


kL, (is 14, 1.€. l, => 0:023A,/n = 0:017A) 
(Z2 —Z)|ZZp = 0°70, ie. Z[Zy = 0°71. 


k,(7 + 21,) = a multiple of 7 


ky 


The only remaining parameter is /,, and to obtain this the 
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Fig. 6.—Curve plotted from measured values of reflection-coefficient 
magnitude for different numbers of strips in section. 


phase of the reflection coefficient must be considered. From 
eqn. (52), 


|p| cos ys = 
(ZZ —Z?) tan $[(ZF + Z7) tan cos (211) — 2ZZp sin (2Kol;)] 
[4Z2Z¢ + (Ze + Z2 tan? J]? 


(56) 


which, together with eqn. (53), leads after some algebraic mani- 
pulation to 


RCOSYE | as F Zp ZN? V2 
— |[p]2i2 E sin? (2K 9l,) + cos* (2K!) (> a e 
i _,(2ZZp tan 2kol, 
x | sin E + tan—! Ey (57) 


which shows that if 2 cos #/(1 — |p|*)"/? is plotted against the 
length of the section a sinusoidal curve again results. The 
zeros differ from those of the previous curve by the angle 


2 tan 2kol 

are tan aia and can therefore be used to determine /;. 
pene tec 
Tez, 


9 5 10 15 20 
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Fig. 7.—Curve plotted from measured values of reflection-coefficient 
magnitude and phase for different numbers of strips in section. 


The experimental curve is shown in Fig. 7, from which it is 
found that 
kol, = — 0-081 


The properties of the strip dielectric are now completely specified. 


i.e. 1, = — 0-013Ay 


(3.2) Comparison of Theoretical and Experimental Results 


The theoretical values for the parameters of the strip dielectric 
are easily obtained by substituting the values of the dimensions, 
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wavelength and effective relative permittivity of the supporting 
material in the appropriate equations of Section 2. The results 
are collected in Table 1. 


Table 1 


THEORETICAL AND MEASURED VALUES OF METAL-STRIP- 
DIELECTRIC PARAMETERS 


| 
| Estimated 
, i Measured : 
Quantity ee a earns 
Refractive index, 7 | Si, 1-38 anil 
Relative impedance, ZZ OS to Oe 74 ane 
Length, /; ae | —0-014A9 | —0-013A0 EMS) 
Length, /> ae | +0-014Ao 0:017A0 22S 
(ee re eS a SS ee eee 


In each case the difference between the experimental and 
theoretical values is within the limits of experimental error. 


(4) CONCLUSION 
It has been shown that the effects of reactive fields at the 
interface between free space and an artificial dielectric may be 
predicted by considering the excitation of evanescent modes. 


; ia’ 1 
exp (—jkc) + j sin (kc) E ain V2 


ECON) 


Reasonable approximations to the phase shifts which occur at 
the interface can be obtained by making allowance for the 
excitation of the first of these evanescent modes. The method 
described in the paper is general and can be applied in principle 
to any form of artificial dielectric. 


She = 
ag 
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(6) APPENDIX 


(6.1) Calculation of the Latent Roots of the Matrix T 


The matrix T is defined by eqn. (18), which can be rewritten 
thus 


= [C — SP + 2S[1 + 4ZN]c . (58) 


From eqn. (17) 
jp — 1(a/A) 


‘ | (59) 
2 +2jp(1 — 1)?(a/A) 


“=i 1, Wp) Pe 
[N+2Z—]-1N=I Ee 


t) 


1 
exp (— fyc) + B sinh (8c) 


The latent roots of T are the roots of the equation 


2ja(l — 2)? 
eRe |-4 


[exp (- Bo0) Be Z 5 Sinh (B,c) — é| 


{ exp (—jkc) + j sin (Kc) 2 


2a(1 — 1)? 
sh Att 


The analysis is based on the neglect of coupling by evanescent 
modes other than the least attenuated, and approximations to 
the same degree of accuracy can be made in calculating the 
latent roots. The values are 


sin (Kc) sinh (8,c) = 0 (64) 


= cos (ke) + 5 loge t sin (ko 


f He “a — t)? sin (ke) exp B20 | (65) 


E, = exp (B,c)/27? (66) 


From the general theory for coupled transmission lines, the 
propagation coefficients of the possible modes are given by 


fi 
b> 


cosh (y,c) = (67) 


cosh (yc) = (68) 


(6.2) Calculation of the Mode Distributions 


The fields corresponding to the two modes have the form given 
by eqns. (24) and (25). The elements [c;;, «2;] come from the 
latent vector of T corresponding to the root €i, so that 


ie, [NV 42Z1olIN = N—oZald 
~ P jp — tala) 
ae Ce Par a ee 
giving ; 
p jp — t)a/d) 
peat fens ae = 1) oP -E2jpQi= ee, oa 
1 a: 2ja — ft)? alls 
Aon D At? At2 
I+4ZN = da) 1 (62) 
es rs re 
Substitution into eqn. (58) gives 
2ja(l — 1)? 2aQl—t) . ske Bac 
| ye sin () cosh (5) am 
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exp (—jkc) +j[ 44 +78 - | sin (kc) — €, 


a) OOS Se Se ee 

a 2(555) (1 — asin ) cosh (2 =i 
eat 2 (55) (i — f) sin ey cosh ta) aA 
Loy) 


exp (—jke) + 4+ aa — | sin (kc) — &, 


To the required degree of approximation these equations 
become 


(71) 
(72) 


4 401, = t) COS (4ke) exp (—4B20) 


X31 = 2x5,(1 — £) sin Gkc) exp (—4f 0) 


The elements By which determine the current (magnetic- 
field) distributions are obtained*:° from the general relation 


ZB =SCaw (73) 


where W is a diagonal matrix with elements tanh Gy,c); s = 1, 2. 


Hence Zi B11 = 1 tan 4k,0)/tan Gkc) (74) 
Z1Bi2 = — jaz cot Gk) (75) 
Z>Po1 = jon, tan Gkyo) . (76) 
ZB = B22 - (77) 


to the same degree of approximation as before. 

A further simplification can be made when the spacings ak @ 
are much less than the wavelength, by replacing the trigonometric 
functions by their first approximations. This leads to the set of 
eqns. (26) to (31), which are accurate to the first powers of (a/A) 
and (c/A). 
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SUMMARY 


The reciprocity theorem which relates the transmission and reception 
properties of an aerial is extended to give information on the phase 
and amplitude of the signal received by the aerial for an incident plane 
wave of any polarization. The paper includes a rigorous proof based 
on the Lorentz reciprocity theorem for electromagnetic fields. 


LIST OF SYMBOLS 


r, 9, 6 = Spherical polar co-ordinates. 
6’, d’ = Polar angles defining direction of incoming plane 
wave. 
75 ¢ = Co-ordinates in feeder. 
k = Phase-change coefficient for free-space plane wave. 
B = Phase-change coefficient for mode in feeder. 
Yo) = Wave admittance for free-space plane wave. 
Y, = Wave admittance for mode in feeder. 
n, v, i,, p = Unit vectors. 
E = Electric field strength with suffixes as defined in text. 
H = Magnetic field strength with suffixes as defined in 
text. 
F, F’ = Radiation patterns. 
A, B = Complex amplitude constants. 
S,, S>, S; = Surfaces of integration. 
R = Radius of spherical surface $3. 
G = Power gain. 
Aeg = Effective receiving area. 
A = Wavelength of free-space plane wave. 


(1) INTRODUCTION 


The usual proof of the reciprocity theorem relating the trans- 
mission and reception properties of an aerial is based on circuit 
theory,!»2»3 and yields information only about signal powers. 
In some recent work by the author, it was necessary to extend 
the theorem to give information on the phases of the signals as 
well as their amplitudes, and this has led to a proof of the 
theorem based on electromagnetic field theory. Further, this 
proof considers only the fields radiated by the aerial under con- 
sideration, and the signal received when the aerial is illuminated 
by an incident plane wave. There is no need, as in the proof 
based on circuits, to introduce a second aerial. 

The situation considered is shown in Fig. 1. L is a feeder, 
which may be either a transmission line or a waveguide, con- 
nected to the aerial, which is contained in the region V. The 
precise type of aerial is immaterial and has no bearing on the 
analysis. 

A reference plane, P, is selected in the feeder and the phases of 
all the fields can be related to the phase at this plane. The only 
restriction on the position of P is that it should be sufficiently far 
from the aerial to ensure that the only field at P is that corre- 
sponding to the dominant mode in the feeder. 

When the aerial is used as a transmitter, power is supplied to 
the feeder and a radiation field is established outside the region 
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Fig. 1.—Diagram showing the physical arrangement discussed. 


The aerial can have any form and lies within the region V. It is fed by the trans- 
mission line or waveguide L, in which P is a plane to which the phases of the transmitted 
and received signals are referred. S 1, S2, S3 are surfaces over which integrations are 
carried out. 


of the aerial. The position of any point can be specified by 
spherical polar co-ordinates r, 0, ¢ referred to an origin, O, 
within the region occupied by the aerial. The transmitting 
radiation pattern of the aerial is then the function of 0 and ¢ 
which describes the variation of the radiated field for a large fixed 
value of the co-ordinate r. If the region V, occupied by the 
aerial, is of finite size, it is clear that in the limit, as r tends to 
infinity, the precise position of O is immaterial so long as it lies 
within V. 

When the aerial is used as a receiver, a plane wave is incident 
upon it and a signal is established within the feeder travelling 
away from the aerial. It will be assumed that the feeder is 
terminated in a matched load so that there is no wave in the 
feeder travelling towards the aerial. The plane wave will also 
be scattered by the aerial, giving rise to a reradiation field. The 
amplitude of the signal received in the feeder will depend on the 
direction of the incident plane wave, which can be specified by 
the spherical polar angles 0, fd. The functional dependence of 
the received amplitude on the angles 0 and ¢ is the receiving 
radiation pattern of the aerial. It will be shown that the trans- 
mitting and receiving radiation patterns are given by the same 
function of @ and ¢, in accordance with the circuit-theory version 
of the reciprocity theorem. Further, the proportionality con- 
stant between the patterns will be obtained in such a way that 
information on phases as well as on amplitudes can be obtained. 


(2) PROOF OF THE RECIPROCITY THEOREM 
(2.1) The Aerial as a Transmitter 
It is assumed that the feeder is matched to the aerial, so that 


[472] 


=e eee Eee 


SS 


Eee 


(there is no reflected wave in the feeder. The transmitting 
iradiation pattern is defined for unit power flow in the feeder, 
(to simplify later calculations of power gain. Expressions are 
\ | os for the fields in the feeder and the fields radiated by the 
; aerial. 


((2.1.1) The Fields in the Feeder. 


For any waveguide or transmission line, the transverse fields 
| for a single mode are given by 


Eé, UB g) | AE(é, n) exp (—jB 4) - : . (1) 


Je ACD XG =) 956) 152 9 rea ened ©) 
‘where €, 7 = Co-ordinates in a plane transverse to the axis of 
the feeder. 


€ = Co-ordinate in the direction of the feeder axis. 
In the present case € is measured from the 
reference plane P towards the aerial. Eqns. (1) 
and (2) therefore represent a wave travelling 
towards the aerial. 

n = Unit vector in the positive C-direction. 

E(€, 7) = Vector function of €, 7, defining the transverse 
electric field of the mode. E(€, 7) can always 
be selected to be a real function and this will 
be assumed here. 

B = Phase-change coefficient for the mode. 
Yy = Wave admittance for the mode. 
A = Complex coefficient giving the amplitude and 
phase of the wave at = 0, i.e. the reference 
plane P. 


Since all phases are being referred to the plane P, A can be 
assumed real without loss of generality. The amplitude of A 
is given by the condition that there is unit power flow towards 
the aerial, so that from Poynting’s theorem 


FAVS ister) dsay — le 2) 8, 3) 
where S, is the feeder cross-section. 


(2.1.2) The Radiated Field. 

The radiated field need be defined only for a very large value 
of the co-ordinate r and at any point appears as if it were a plane 
wave travelling in the positive r direction. The radiated field 
can therefore be expressed as 


E-aa(, 9, ¢) = Ae?) exp(—jkr) . . . ) 


DAT 0, p) = (Yo/r)i, x FO, p) exp (— jkr) . . (5) 


where Y) = Wave admittance of a plane wave in free space. 
F(@, 6) = Transmitting radiation pattern, expressed in vector 
form to define the direction as well as the magni- 
tude of the radiated field. 


and i, = Unit vector in the direction of increasing r. 


Since the radiation field behaves locally as a plane wave, both 
the electric and magnetic fields are perpendicular to the direction 
ofi,. This is satisfied by eqns. (4) and (5), provided that 


PMO tO 90) Tel) 20.019 86) 


The factor r which appears in the denominators of eqns. (4) 
and (5) ensures that the inverse-square law for power is satisfied. 


(2.2) The Aerial as a Receiver 
Let the direction of the incoming plane wave be specified by 
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the spherical polar angles (6’, ¢’). The most general expression 
for a plane wave from this direction is 


Einc&r, 9, &) = Ep exp { jkr[sin 0’ sin € cos (f’ — 4) 
+ cos 0’ cos ay} rex Gp, 


Hy GRONO) = "Yop Eas a See) 


where p is a unit vector in the direction of propagation defined 
by the angles (0’, ¢’) 

and £p is a constant vector giving the direction and magnitude 
of the electric field. Since the directions of the 
electric fields and of propagation are orthogonal for 
a plane wave, 


Eq..p —90 5 5 . ' 4 e (9) 


This incident field establishes a wave in the feeder travelling 
away from the aerial. From the results of the previous solution, 
this wave can be expressed as 


Ex(§, 7, 0) = BE(é, n) exp BD) . (10) 
HE, n, 2 = — Y,Bn x ECE, n) exp GiB8D) . (11) 


where B is a complex constant giving the amplitude and phase of 
the wave at the reference plane, P. The value of B 
will depend on the values of Ep, 6’ and ¢’. 


The incident field also causes reradiation by the aerial and the 
resulting fields can be expressed as: 


Era’, 9, $) = -F'O, $) exp (— jkr) (12) 


’ eas 
Ayaalr, 9, $) = > Yolr X F (0, $) exp (— jkr) (13) 


where F’(0, d) is the reradiation pattern and depends also on the 
values of Ep, 0’ and ¢’. FO, ¢) will not in 
general resemble F(6, ). 


(2.3) Application of the Lorentz Reciprocity Theorem 
A general theorem which applies to any electromagnetic 


fields in a source-free region bounded by a closed surface S, is* 


| (E, x Hy) — (Ey x Hy). vedS =0 (14) 
S 


where £,, H, and E,, H, = Two solutions for the electromag- 
netic fields within the region, 
v = Unit vector in the direction of the 
outward normal to S. 


This theorem will be applied to the present problem by 
taking E,, H, as the fields when the aerial transmits (Section 2.1) 
and E>, H> as the fields when a plane wave is incident on the 
aerial (Section 2.2). The surface S is shown in Fig. 1 and 
consists of three parts: 


S; = The cross-section of the feeder at the reference plane P. 
S, = The outer conducting surface of the feeder. 
S; = A sphere of radius R centred on 0. 


On $5, any electric field must be normal to the surface and 
any magnetic field must be tangential to it. All products of the 
type E x H must therefore vanish, so that there is no contribu- 
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tion to the integral in eqn. (14) from S. Substitution of the Further, in eqn. (23), the unit vector i, is defined at 6=06, 
appropriate values for Ej, etc., in eqn. (14) then gives dé = ¢’ and is therefore equal to —p, so that 
ly = — 2YoRexp (—jkKR)Ey. FO’, $1. (25) 


The double integral J can be evaluated directly by the method 
described by Silver,* giving 


ss ie {— ABY,E(é, n) x [ x E(é, n)] — ABYWEE, ») 


1 : 
x [n x ECE, |} .nd&dyn + lg e exp (— jkR)F(0, ¢) . 
1 = expGkRa j7)) aa ae 
KR 
The results for J, and J, can now be substituted in eqn. (15) and 
an expression obtained for the complex amplitude, B, of the 
= [En (R, 0, 6) + J exp (— jkR)F’(6, | wave in the feeder when the aerial acts as a receiver. This ? 
Leia: R expression 1S 


x [ Hin R, 0, 6) + a x F’(0, d) exp (— jieR) | 


YoA . / £ 
x [52 x F(6, $) exp (— jee |N,ae sin Odbdb =0 . (15) B= — exp (— ja/2)Eo- FO, $) . - @1) 
; It will be noticed that this result is independent of the radius R 
Since m and E(€, 7) are orthogonal for which the integrations were performed. The results for these 
E(é, n) x [n x E(é, 9)] = |E(E, 7)\2n . . (16) integrations are exact as the radius R tends to infinity. 
so that the first integral in eqn. (15) reduces to (3) DISCUSSION 
; Eqn. (27) can be regarded as an extended form of the aerial 
I, = 2AB vat 5 /E (€, m)/’dfdn . . . (1D reciprocity theorem and gives in a convenient form the behaviour 
of the aerial as a receiver in terms of its properties as a trans- 
Further, the integral has been evaluated in eqn. (3), so that mitter. The extension over previous statements of the reciprocity 
_ theorem is twofold: 
hime tid pon tiss ws > 18) (a) the phase ofthe signal received by the aerial feeder is 
In the second integral, both F(@, 4) and F’ (6, ¢) are normal to determined as well as its amplitude, and A 
i, and from this it follows that the terms involving F’(6, ¢) (b) there are no restrictions on the direction of polarization of — 
cancel, leaving the plane wave incident upon the aerial. 
The derivation of the result is completely rigorous and full 
I, = Rexp (— JRR)| {F(, hb) X Hind R, 9, 4) account is taken of the reradiation from the aerial when it acts 
Jss as a receiver. The extension of the result to incident fields — 


P7YeE AR 0. o)) [Lo EO, ¢)}} .i, sin 0d8db == (19) which are not plane waves has already been discussed.® : 
A convenient way of expressing the amplitude reciprocity — 
The final term also simplifies since F(6, ¢) is normal to i,, giving relation is to state the effective receiving area of the aerial in 
terms of its power gain. It will be shown that eqn. (27) gives the 
I, = Rexp (— JkR)| {[F (0, b) X Hinc(R, 9, $)] «i, same result as obtained by other methods. 
3 The power gain of the aerial at any point is the ratio of the 
— YoEn&R, 8, p)'. FO, $)} sin d6db . (20) power flux per unit area at that point when the aerial is radiating 
to the power flux per unit area at the same point when the same 


Substitution from eqns. (7) and (8) gives total power is uniformly radiated in all directions. It follows 


Qn 7/2 from the definition of the transmitting radiation pattern in 

I, = YoR exp (— ikR) | | UF (0, $) x (p x E))] -i, Section ee the power gain for the direction defined by the 
yu) angles (6, ~) is 

— E,. F(@, ¢)}x exp {jkR [sin 0’ sin 6 cos (d’ — ¢) GO,) = Yo|F@, )|? Sa = AS} 

Fk BRP ay x sin Odbdb . (21) The effective receiving area of the aerial when a plane wave 


is incident from the direction (6, $) is the ratio of the power 
In the integrand the argument of the exponential varies rapidly delivered to the feeder to the power flux per unit area in the 
with @ and ¢ and the method of stationary phase> can be used, incident wave. Hence, from Section 2.2, 
This gives an asymptotic expansion for the integral, and in the 


limit, as R tends to infinity, only the first term of this expansion A.g(@, ¢) = AB)? YoS fo EE, Pasa (29) 
is needed. The stationary-phase point occurs at $Yo|Eo|? 
rend ae ee ae Se (22) When eqns. (27) and (3) are used, this becomes 
that eqn. (21 Yok)" [Eo - FG, ¢)P 
so that eqn. (21) becomes Ag (0, $) = (7 Yo Abo ( )] (30) 
In = YoR exp (— jkR) Y|Eo| 


eae : It is obvious that the receivin h i 
FO, . wep a sae Abs e: g area has a maximum value when 
{[ (1,6) x (p oO]. — Ey. FO’, PP - (23) Eo is parallel to F(@, 4), a result to be expected on physical 


where grounds. For this case 


2r ~1/2 ae 2 
I 2) i exp {ikKR [sin 6” sin 6 cos (6’ — d) pi Sr 
9 °—n/2 where A is the free-space wavelength, equal to 2n/k. 
* Referenced, ps121. 


+ cos &’ cos 6] sin Odbdb . (24) 
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Finally, if |F(@, ¢)|? is eliminated from eqns. (28) and (31), 


there results 
Ag, b) = GO, $)/4z 


a well-known formula. 


(32) 
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SUMMARY 


The logical treatment of the unified-machine theory is of such 
merit that it seems likely eventually to supersede the classical treatment 
entirely. The unified theory is, however, at present deficient in two 
respects. It requires two primitive machines, a slip-ring primitive 
and a commutator primitive, as the bases for analysis, and it tacitly 
assumes the existence of a perfect commutator. The mechanism of 
commutation is entirely neglected. 

The paper gives a new theory of commutation, which not only 
elucidates this mechanism but enables the performance of any machine 
or machine system to be developed from one basis only, namely the 
simplest case of two coils with relative angular movement. : 

Each stage of the analysis is supported with a comprehensive series 
of experimental results. The tests involved introduce some interesting 
problems, and a new method for the measurement of the coefficient 
of self-inductance to direct currents is developed. 


(1) INTRODUCTION 


The great merit possessed by the unified-machine theory is 
that of a continuous logical development from first principles to 
complex machine systems. To make clear the object of the 
paper, a brief recapitulation of this development is necessary. 

The simplest form of electrical machine is a solenoid or 
electromagnet, in which the inductance varies with the position 
of the armature, and hence with time. The voltage equation is 

dl dL 


= ! 
V RI+ LT + IF : 


and the power equation becomes 
d(4LI?) aL 
Pehle fe 
Sa ag 
The final term in this equation denotes the electrical power 
converted into mechanical power by the movement of the 
armature; it must therefore equal Fdx/dt, where F is the force 
upon, and x the displacement of, the armature. It follows that 
dL 
) ey ley (AEE 
dx 
Alternatively, the variation of L may be caused by a variation 
in angular position 6, so that, similarly, if T is the torque, 
dL 
Pete” po ee oe ee 
do 2) 
Eqns. (1) and (2) are the voltage and torque equations of the 
simplest machine. 
For two or more coils, the concept of mutual inductance is 
required. For two coils with mutual inductance the voltage 
equations are 


(1) 


ALI) p (MoI) 


Ve= Rh oF a 
d . . . (3) 
(Lolz) , d(My1,) 
Vy ae Roh ag dt I dt 
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and consideration of the power equation gives for the torque 


dL dL dM. 
Qo pets ee 
170 27 + hb i ee (4) 


In obtaining the torque equation, the assumption has been made 
that M,, = M),. This equality is well established for linear 
(air-cored) inductors, and is capable of theoretical proof from 
energy considerations. For saturable iron-cored inductors, 
however, it is considered that the equality can only be established 
by experiment. A wide range of experiments by the author has 
shown that, provided the saturation conditions are carefully 
reproduced, the equality is always confirmed. It is therefore 
assumed throughout the paper, and there results a certain sym- 
metry in the voltage equations, which is seen later to be of great 
importance. 

Eqns. (3) and (4) are adequate for the analysis of all electrical 
machines not fitted with commutators. Such ‘slip-ring’ machines 
include not only alternators and wound-rotor induction motors, 


ip ey 


tH 


but also machines without any connections to the rotor, such as | 


squirrel-cage motors and inductor alternators. Commutator 
machines require a different method of analysis, which introduces 
fundamental difficulties. These difficulties will be investigated 
in detail later. 

Although the equations of any slip-ring machine may be 
written down immediately by comparison with eqns. (3) and (4), 
yet the solution of these differential equations for any particular 
type of machine has constituted historically the major problem 
of machine analysis. Jn the case of the alternator, the-problem 
was finally resolved by Park,! after valuable preliminary work 
by Blondel”: 3 and by Doherty and Nickle.4 The corresponding 
induction-motor problem was solved by Fortescue.» The 
equations are to-day most satisfactorily treated by the methods 
of tensor analysis, which were first applied to machine problems 
by Kron. The subject has been given an exhaustive analysis 
by Kron himself,® 7 ® and also by Gibbs,? Lynn!° and others. 

In matrix form the basic equations given above may be 
written 


V = RI+ pL (3a) 


dL 
ik ll 
4 t de (4a) 
where the voltage and current matrices V and I have as many 
rows as there are different windings on the machine. The 
symbol J, denotes the transpose of the current matrix, while 
R and L are the resistance and inductance matrices, R being 


diagonal and L symmetric. The operator p (= d/dt) acts on the 
product of Z and J. 


(2) THE SLIP-RING PRIMITIVE MACHINE 
Kron’s slip-ring primitive machine consists, as shown in 
Fig. 1, of two stator windings and two independent rotor 
windings, whose ends are connected to pairs of slip rings. 
Eqn. (3) is readily extended to the case of four windings, and the 
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eee equation of the machine is given in its most general have been given elsewhere, it is given in full in Section 11. The 
orm by transformed voltage equation has the form 


Via Ze 


where 


(5) 


For further progress a knowledge of the variation of the 
inductance coefficients with @ is necessary. 

For alternators and induction motors, the following assump- 
tions may legitimately be made: 


(a) that the stator windings are in electrical space quadrature; 

(6) that the rotor windings are in electrical space quadrature; 

(c) that the rotor windings are similar; 

(d) that the rotor has a smooth cylindrical surface; and 

(e) that space harmonics higher than the second may be 
neglected. 


The impedance matrix then assumes the more concrete form of 


eqn. (6),% 78:9, 19 thus (8) 


—Mypsin 0 


R, + Lyp Mpp cos 0 


ee ee ee ) 
Mp cos 8 R; + Lap — L,p cos 20 —Lpp sin 20 


Mop sin 0 —Ly,p sin 20 R; + L,p + Lyp cos 260 


Here the operator p acts in general on products of functions of 8 
and time-varying currents. 

The difficulties of solution of the above equations are primarily 
due to the fact that not only are the inductance coefficients h, 
non-linear functions of the currents, owing to saturation, but 
the saturation of the rotor windings also varies with the \ m 
rotor position. Although the first stages in the solution of this awe = 
alternator problem were developed by Blondel from physical ‘a 
considerations, it is possible to consider it as the purely mathe- 2 
matical problem of finding a passive transformation which will — OR 
eliminate 6 from the impedance matrix. The required trans- —_ 
formation is \ 


Fig. 1.—Slip-ring primitive machine. 


(9) 


From the tensor point of view, this transformation is aon- 
holonomic, and the use of advanced concepts of analytical 
dynamics and tensor analysis is required for its successful 
application. The correct result may, however, be readily 
eotained by direct matrix methods. This matrix analysis is 
considered to be important, and since it does not appear to where w = d0/dt, Lg = L, + L, and L, = L, — Ly. 
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The torque is given by the expression 


T=fGr (10) 
where G’ is the coefficient of w in Z’. 

Certain points of great importance arise in connection with 
eqns. (8), (9) and (10). 

First, it is clear that the transformation has in fact resulted 
in the elimination of @ from the alternator equations. Not only 
does it now become possible to solve the equations by normal 
methods, but, equally important, saturation curves of Lg and 
L, may be determined uniquely by experiment, since the values 
of these quantities are no longer influenced by rotor position. 
The new equations, unlike eqn. (6), are therefore of direct 
practical application. Whilst no great attention is paid to the 
point here, it must be stressed that such phenomena as saturation 
and iron losses can be treated at least as easily with the unified 
theory as with classical methods. This is essential, because a 
theory which neglected these phenomena, however mathemati- 
cally elegant it may be, would be of little practical value. 

Secondly, the transformed impedance matrix is no longer 
symmetric. Whilst the self-inductances on the leading diagonal 
and the mutual inductances on the trailing diagonal are sym- 
metric, the rotational inductances, which appear on the rotor 
rows only, are not. The original impedance matrix of the 
machine [eqn. (6)] was derived from physical considerations 
and is symmetric. The asymmetric matrix [eqn. (9)] followed 
as the result of a mathematical transformation. It need not be 
symmetric, since it no longer has a direct physical significance. 

Thirdly, the torque equation (10) now has a different form 
from that of the original equation (4a). 

The second and third points arise again in the next Section, 
and will be discussed fully later. 


(3) THE COMMUTATOR PRIMITIVE MACHINE 


The derivation of the equations of the slip-ring machine 
from first principles, considered in the previous Section, was 
perfectly straightforward. This is not the case with the primitive 
commutator machine to be considered now. 


z 
| 


4a 


Fig. 2.—Simplest form of commutator machine. 


If the simplest form of commutator machine of Fig. 2 be 
considered, it is possible, by means of a series of simple experi- 
ments at the field and armature terminals and at the shaft, to 
establish the familiar equations 


Vp Rela Leply 
V, = Roly + Lapl, + M'wl; ~ 
Lo MII, (12) 


Consideration of the armature voltage equation shows that 
the units of the product M’w are ohms or henrys per second. 
Since w is the angular velocity of the rotor in electrical radians 
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per second it follows that the units of M’ must be henrys per 
electrical radian. 
of the nature of a space rate of change of inductance. Such 
inductance quantities are of frequent occurrence and will be 
referred to as rotational inductance coefficients. 

If the cross-field machine in Fig. 3, which has two stator and 


e 


| 


q 2 
Sak: 


d 
Fig. 3.—Commutator primitive machine. 
two rotor windings, is investigated in a similar manner, its 


equations, conveniently written in matrix form, are found to 
be as follows: 


R, + Lop 


Mp 


wM, 


(13) 


If the two rotor windings are similar, this impedance matrix 
becomes identical in form with that of eqn. (9). This result is 
made even more remarkable by the fact that if the rotor of the 
slip-ring primitive machine (Section 2) carries balanced 2-phase 
currents, the transformed currents and voltages of eqn. (8) 
become direct currents and voltages. It follows that the p 
terms disappear from eqn. (9) and that the transformed equations 
of the alternator become identical with those of a d.c. com- 
mutator machine. The discovery of such mathematical analogies 
between the equations of types of machines that are physically 
quite distinct forms one of the main attractions of Kron’s 
unified-machine theory. 

Unfortunately, the existence of this remarkable analogy does 
not eliminate the severe difficulties presented by the equations 
of the commutator machine, as they appear above. These 
difficulties may be specified as follows: 

(a) The equations of the machine are not symmetric. It is 
true that the equations of the transformed slip-ring machine 
were not symmetric. But in that case the initial equations, 
which represented directly the physical performance of the 
machine, were symmetric, and the asymmetry only arose as the 
result of a mathematical operation. Since the mathematical 
process was correct, the result must be correct and no difficulty 
arises. The commutator-machine equations, on the other hand, 
were derived as the result of experiments at the machine ter- 
minals, and must therefore be looked upon as giving a true 
physical picture of the performance of the machine. But the 
equations of any linear static network and any slip-ring machine 
are known to be symmetric, and in nature, in general, action 
and reaction are equal and opposite. The precise mechanism 


It will be confirmed later that M’ is in fact | 
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whereby the asymmetry of the commutator machine arises may 
therefore properly be looked upon as a difficulty that requires 
further investigation. 
(b) The equations of the machine give no explicit evidence 
whatsoever of the existence of any commutator, nor of any 
interpole windings. From design and frequently from opera- 
tional considerations, as well as in the classical theory of d.c. 
machines, the study of commutation plays a fundamental role. 
It follows that the complete disregard of this vital phenomenon 
may properly be considered as a difficulty in eqns. (11)-(13). 
(c) The equations of the machine ignore completely the 
presence of two windings, the short-circuited turns and the 
‘interpoles. These windings introduce two phenomena at least 
which, it would seem, cannot be neglected without justification. 
First, in the simple commutator machine in Fig. 2 there is a 
fixed position in space at which the current in the armature 
windings is continually being reversed. Since this position is 
on the direct axis of the field, it might be anticipated that a 
very high voltage would be induced thereby in the field winding. 
This voltage was measured for a particular machine and found 
to be ten times the rated voltage of the armature. Secondly, 
since the output voltage of the machine is produced by the 
rotation of the armature winding through the flux of the field 
winding, it is to be expected that a much higher voltage would 
be generated in the field by the rotation past it of the flux of 
-armature reaction. Measurements of this voltage show that it 
-aiso has a very high value. The apparent neglect of such 
important phenomena constitutes a third difficulty. 
The point is not that the equations above are incorrect; they 
are in fact derived from the results of direct experiments. It is 
simply that the terminal quantities are only the end-product of 
_a complex process, and whilst it may be legitimate to utilize 
this end-product for further analysis, a study of the underlying 

processes is essential if the equations are to be convincing and 
if the unified theory is to be considered wholly satisfactory. This 
study is carried out in the remaining part of the paper. 


(4) DERIVATION OF THE COMMUTATOR MACHINE WITH 
INTERPOLES 

Further consideration of the simple commutator machine in 
Fig. 2 shows that it has in effect four distinct windings, of which 
only two appear in the diagram and in eqn. (11). There are in 
fact two stator windings set in electrical space quadrature, 
namely the field and the interpoles, and there are two rotor 
windings also set in electrical space quadrature, namely the 
main armature winding and the short-circuited turns. Although 
these two rotor windings are actually part of one closed sym- 
metrical armature winding, yet they must be treated separately, 
because the action of the commutator is such that the currents 
in the short-circuited turns are different from those in the main 
armature. 

The following analysis begins with the 4-winding slip-ring 
primitive machine shown diagrammatically in Fig. 1. The 
intention is to convert this machine into the (nominally) 2- 
winding commutator machine in Fig. 2 by converting one stator 
winding (1) into the field winding, the other (2) into the interpole 
winding, one rotor winding (3) into the main armature winding 
and the other (4) into the short-circuited armature turns. The 
conversion, however, is to be a physical and not a mathematical 
ene. 

The general equations of the machine in Fig. 1, which no 
tonger has similar rotor windings, are as given in eqn. (5). The 
*orque is given by eqn. (4a). The only simplifying assumption 
tnat it is legitimate to make is that the rotor is cylindrical, from 
which it follows that the self-inductances of the stator windings 
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are independent of the angular position of the rotor, and their 
mutual inductance is zero. With these exceptions, the inductance 
coefficients in the impedance matrix are all functions of rotor 
position. These coefficients have been measured for a particular 
machine and are as shown in Fig. 4. The curves, which are 
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Fig. 4.—Variation with rotor position of the coefficients of self- and 
mutual inductance. 


discussed in detail later, are considered to be typical. Normal 
assumptions about the symmetries of the iron circuit and 
windings enable the general form of the inductance coefficients 
to be expressed as Fourier series of @ as follows: 

L, and L, are independent of 


oo > 
L, = Ly + >) £5,.c0s Cx!) 
n=1 
Mi == 0 


My3 = X Miaqsin [Qn — 196] 
n=1 


s 
| 


= y My4n cos [(2n — 16] 
n=1 
(14) 


> 
| 


= L49 — SS I hope cos (2n8) 
n=1 
M3 = bs M3n cos [(2n = 1)6] 
n=1 


My, = Y Mog, sin [Qn — 16] 
1 


a 


ao 
M34 = yy M34n sin (2n8) 
1 


i= 
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The voltage generated in the windings of a normal d.c. com- 
mutator machine is alternating, and the purpose of the com- 
mutator is to reverse the connections to the coils at the instant 
when the voltage generated in them changes sign. As far as 
currents in the armature are concerned, reversals occur twice in 
a cycle and the time of reversal is relatively short. As far as 
position in space is concerned, however, the process of current 
reversal is continuously repetitive. Whilst the current in a 
particular coil reverses but seldom, there is always some coil in 
which a reversal is taking place. From this point of view the 
time of a cycle of operations is clearly the time taken by the 
rotor to move through the thickness of a brush. (This is on the 
assumption, for the moment, that the thickness of a brush is 
equal to the thickness of one commutator segment pitch.) 

During the time of commutation, the connections to the 
armature of the machine are not changed, so that during this 
short interval the machine must behave in exactly the same 
manner as a slip-ring machine. At the end of the interval the 
connections are suddenly changed, and an identical cycle of 
events begins. The point of the argument, therefore, is that the 
behaviour of the commutator primitive machine may be deter- 
mined completely simply by analysing the behaviour of an 
appropriate slip-ring primitive machine whose equations are 
known, during the time of commutation of one coil. 

This method of analysis overcomes the difficulties referred to 
in Section 3; for the impedance matrix is symmetric, the inter- 
poles and short-circuited turns are taken into account, and all 
the induced voltages in the machine are considered. 

Let the interval of commutation be 6. Then, since the axis 
of the coils being commutated is the direct axis of the field, it 
follows that the limits of @ during the time of commutation will 
be from —46 to +46, 27/5 being the number of commutator 
segments per pair of poles. In practice, the value of 5 is small, 
and it is a reasonable assumption, so far as the values of the 
inductance coefficients given in eqn. (14) are concerned, that 
sin 20 = sin 6 = 0 and that cos6 = 1. When these assump- 
tions are made, @ disappears explicitly from the impedance 
matrix and an appreciable simplification occurs. The general 
term in the voltage equations p(MJ) may be expanded thus 


DMD = MpI + wM'I 


where M’ = dM/d@ (a rotational inductance coefficient) and 
w is written for d@/dt, so that wherever p appears in the impedance 
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Fig. 5.—Diagram showing the double action of brushes in providing 4 


an external connection to the armature (3) and a short-circuit 
path for the coils undergoing commutation (4). : 


set. The brushes, in fact, may be considered as having two 
purposes; the first, corresponding to (3), of supplying current 
to the armature, and the second, corresponding to (4), of 
reversing or short-circuiting the armature current. 
noted that the method of deriving the matrix is such that it 
remains symmetric. 

In practice, the armature and interpoles are connected in 
series so that their magnetomotive forces oppose. This con- 
nection may be accomplished by means of an active trans- 
formation of standard type,”> ®° thus 


Z = CZC 


where Z is the old and Z’ the new impedance matrix. C is the i 
connection matrix given in this case by the relationship 


V; wM. 13 My4p 
Vag | = (Ry + R3) + Ly — 2M3 + L3)p| —w(My, — M34) (16) 
V4 —w(My4 — M3,) 


matrix it operates upon current only. Using eqns. (14) with the 


above assumptions, the impedance matrix of eqn. (5) becomes 


(15) 


This impedance matrix applies to the machine in Fig. 5, 
wherein the two sets of brushes shown are in reality the same 


where V3 is the voltage applied to the armature and interpoles 
in series, and J,3 is their common current. 


The impedance matrix is naturally still symmetric. 


(5) ELIMINATION OF THE SHORT-CIRCUITED TURNS 


From one point of view the voltage equation just given is 
final. It gives the relation between the voltages and currents in _ 
the four different windings of the machine over a particular 
interval of time representative of the complete operation. On 
the other hand, it is a basic principle of the tensor analysis of 
the performance of machines that there can be only as many sets 
of equations as there are sets of terminals on the machine. 
Since the commutator machine considered (Fig. 2) has terminals 


It will be 5 


only for the field and for the armature, it is necessary before 
‘he equations can be regarded as wholly satisfactory to eliminate 
rom the impedance matrix the row and column corresponding 
To) the short-circuited turns, since there are no external con- 
nections to these turns. There is a standard procedure for the 
rlimination of unwanted rows and columns of a matrix,” ® but 
owing to the unusual nature of the present problem it is more 
satisfactory to use a different method. 

First of all, let eqn. (16) be expanded out of its matrix form. 
his gives 


Vi = (Ry + Lypyl, + Mi 3h3 + Mysply (17) 

Wo3 = wMy3I, + [CR + R3) + (Ly — 2Mp3 + L3)p|3 
— w(My4 — My4)Iy (18) 
Va = Mysply — o(My4 — Mya)o3 + (Ry + Lap. (19) 


iit is now proposed to consider these equations one by one. 


(5.1) The Short-Circuited-Turns Equation 
The voltage equation of the short-circuited turns just given is 


V4 = Mygpl, — w(My, — M3a)l3 + (Ry + Lap, 


ach term of this equation will be studied in turn. The first 
erm, V4, represents the voltage applied to the short-circuited 
ums from an external source, and is clearly zero. The second 
erm, M,,pl,, represents physically the voltage that would be 
induced in the turns by simple mutual induction if the field 
current was changing. For a d.c. machine, the field current 
ay be assumed to be constant so this term will disappear. 
or an a.c. commutator machine, however, the term will be 
portant unless the interval of commutation is very small 
compared with the period of the supply. For the moment, a d.c. 
achine will be assumed and this term may therefore be 
meglected. 
Eqn. (17) thus reduces to 

(Ry + Lyp)l, — w(Ma4 — M34)3 = 0. (20) 
[The significance of the components of these terms is as follows. 
i\R4l, is the resistive voltage drop in the turns; the value of R, 
kdepends not only upon the resistance of the conductors, but 
also upon the resistance of the brushes which complete the 
‘short-circuit, and upon their contact resistance. Further- 
more, this latter resistance is known not to be constant. 
The term L4plI, is the voltage due to the self-inductance of the 
turns, and wM3,4/; is the voltage that would be generated in the 
turns by their rotation in the field of the armature (armature 
reaction) if the interpoles were omitted or unexcited. Finally, 
since Ih, = — l, = + 13, the term —wM>,4l,3 is the voltage 
that would be generated in the turns by their rotation in the 
field of the interpoles, if the armature were unexcited. MM), is 
not normally constant during the interval of commutation, since 
the interpole shoes are in general bevelled to improve com- 
mutation. 

If the interpole winding were arbitrary, the above equation 
could be used to determine the variation of the current J, during 
commutation. But, in fact, the interpole winding is not arbitrary. 
ti is designed with the specific object of ensuring that the 
ai mature current shall be precisely reversed during commutation. 
lt follows that eqn. (20) must only be another form, albeit an 
wafamiliar one, of the equation used by the designer to determine 
the number of turns and the dimensions of the magnetic circuit 
i the interpoles. It expresses the familiar fact that the ampere- 
firns on the interpoles must first counteract armature reaction 
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and then set up a flux sufficient to produce a voltage equal and 
opposite to the resistive and self-inductive voltages in the turns. 

From the point of view of operation as distinct from design, 
it is of interest to solve eqn. (20) on the assumption that the 
parameters are constant. The solution will be only approximate, 
since Ry, varies with current and M3, with position. The solution 
is 


Ig = Ae~RslEOt + (Mog — Myq)eoly;/Ry (21) 
The value of the constant of integration A is found from the 
fact that at the beginning of commutation, when t = — 47,, 


Substitution of the value of A thus found gives 
= (M34 ze M3,)wly3/ Ry 
— [1 + (Mo, — Mj4)(w/ Rg] Lo3e7 Bal EN +470) (22) 


For ideal commutation J, must equal +J,; at the end of 
commutation, i.e. when t= +47,. Making this substitution 
and solving for M34 gives 


Myy = Mig + (Rafw)(1 + e-RTelayy(1 — e- Reels) | (23) 


It is clear that this may be looked upon as an alternative 
formula for the design of the interpoles. If the value of the 
index of the exponential is small, a simpler formula may be 
found by expanding only as far as the first order of small 
quantities, giving 


Ta x. 


My, = Mg, + 2L,/6 . 


where 6 = wT. is the electrical angle of commutation, which is 
constant. On this approximate basis the equation of J, reduces 
to 


(24) 


Ig = (3 T )h3 


and commutation is linear. 

In general it is found that the effect of the non-linearity of 
R, is to tend to make commutation linear, even when the 
expansion of the exponential is not legitimate. 

For a well-designed machine, therefore, eqn. (19) for V4 
simply gives a correct description of the process of commutation. 
Since it otherwise contributes no information about the per- 
formance of the machine, it may be omitted from further con- 
sideration. Two important points remain to be mentioned. 
First, for a.c. operation the mutual inductance between the 
short-circuited turns and the field winding would appear to 
make perfect commutation impossible. Secondly, the analysis 
made above for the particular case when the brush thickness is 
equal to the commutator pitch may be extended directly to the 
more general case. The voltage of the short-circuited turns will 
now be given by a more complex equation, but it will still be 
this equation that determines the design of the interpoles. 


(5.2) The Armature Equation 
The armature equation has been given as eqn. (18): 


Vo3 = wMi3l, + [(Rp + R3) + (Ly — 2MQ3 + Ls)p)h3 
rae «(M4 a Mya)Iy 


Here, V5; is the externally applied armature voltage, and wMj3/; 
is the voltage generated in the armature by its rotation in the 
flux of the main field. The term (R, + R3) + (L mT 2M); + L3) 
simply represents the self-impedance of the armature and inter- 
poles in series. The negative sign of the mutual inductance 
arises from the fact that the ampere-turns of the two windings 
are opposed. Each of these terms is straightforward and none 
requires further consideration. 

There remains the final term —w(M3, — M34)J4, which repre- 


482 


sents the voltage generated in the armature and interpoles in 
series by the rotation past them of the flux produced by the 
currents in the short-circuited turns. During the interval of 
commutation this voltage changes sign with Jy. For a d.c. 
machine, the other quantities in the equation are essentially 
constant, so the importance of the term under consideration 
lies in its mean value rather than in its instantaneous value 
during the interval of commutation. The continuous change 
in J, will produce a ripple in the armature voltage, but this is 
usually unimportant. So far as the mean value of the term is 
concerned, it is clear that for linear commutation the mean 
value of J, is zero and therefore the term will have no net effect 
upon the armature voltage. 

Should the interpoles be too weak, however, then towards 
the end of the interval of commutation the current in the short- 
circuited turns will not be approaching its correct reversed value. 
Sparking does not necessarily follow, because the brush resistance 
may be sufficient to force the current rapidly to its correct value 
by the end of commutation. Nevertheless, the mean value of 
the current in the short-circuited turns will no longer be zero, 
and the effect will be to accentuate the normal falling of the load 
characteristic of the machine. Similarly, overwound interpoles 
tend to give the machine a rising characteristic. These pheno- 
mena are both known but not perhaps well known. The effect 
of the repetitive change of current in the short-circuited turns is 
therefore to produce a ripple in the armature voltage, but for 
linear commutation the net effect is zero. On the assumption 
of linear commutation the effect of the /, term is thus negligible 
and it may be omitted from the equation. Eqn. (18) therefore 
assumes the simpler form 


Va ail Reese aap) ls Cc») 


where R53 and L,3 are written for the resistance and inductance 
respectively of the armature and interpoles in series. 


(5.3) The Field Equation 


The remaining equation is eqn. (17), namely 
Vi = (Ry + Lip), + OM 733 + Mygply 


where V, is the voltage applied to the field and (R, + L,p)J, is 
the voltage drop due to the field’s self-impedance. Both these 
terms are straightforward and need not be considered further. 

The last two terms are the most interesting in the analysis. 
The term M,,4pl, represents the voltage induced in the field by 
the change of current in the short-circuited turns, which lie on 
the same axis, and wM/731,;3 is the voltage generated in the field 
by the rotation of the current-carrying armature. A little con- 
sideration shows that both these terms are relatively large, and 
cannot be neglected as second-order effects. For, in the term 
M,4pl, the component pl, is due to the reversal of the full 
armature current in the short interval of commutation, whilst 
Mj, depends upon the number of turns of the field winding, 
which is large. The main generated voltage is given by wMj3/,, 
and since J53 is normally much greater than J,, it follows that 
the term here, namely wMj}3/,3, must also be very large. As 
mentioned earlier, neither of these terms appears in the normal 
form of the equations of the machine. 

The solution to the difficulty lies in the fact that although both 
terms are large they are at all times equal and opposite. Their 
net effect is zero. This result, which is of fundamental impor- 
tance, may be demonstrated as follows. 

Consider a full-pitch armature coil connected to slip rings. 
Its mutual inductance with the field winding over a cycle is 
shown in Fig. 6(a). This quantity will be unaffected if the ends 
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Fig. 6.—Reaction on the field of a rotating current-carrying 
armature coil. 


(a) Mutual inductance. 
(6) Current. 

(c) Flux linkage. 

(d) Induced voltage. 


of the coil are connected to commutator segments instead of 
slip rings. Suppose that the turn is, in fact, part of the armature 
of the commutator machine. Then, assuming linear com- 
mutation, the current in the turn over the same interval will be 
as shown in Fig. 6(b). The voltage induced in the field by 


rotation of this turn will be given by d(M1)/dt. The product MI 4 


is shown in Fig. 6(c), and its differential, giving the voltage, in 
Fig. 6(d). Consideration of this curve shows that the positive 
peaks are produced by direct mutual induction, since they 


occur when M is constant and therefore represent Mdl/dt. The | 


integral of the induced voltage curve [Fig. 6(d)] is the curve MJ 
in Fig. 6(c), and the average value of the induced voltage over a 
cycle is therefore zero. Nevertheless, if the armature winding 
of the machine did consist of a single coil only, the reaction on 
the field given by the voltage waveform [Fig. 6(d)] would be 
appreciable. The reactions of a uniformly wound armature 
may be determined from the following considerations. 

The voltage induced in the field at any time by the reversal of 
current in the coils being short-circuited is given by the positive 
peaks of the curve in Fig. 6(d). The voltage being generated 
in the field at the same time by the rotation of any other set of 
armature coils depends upon the position of that set. For one 
particular set of coils, for example, the voltage might be given 
by the ordinate A in Fig. 6(d), and for the neighbouring set by 
the ordinate B. It follows that if the curve in Fig. 6(d) is divided 
up into equal strips of width 6, the mean voltage generated in the 
field at a particular time by the whole armature (less the short- 
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circuited turns) is the sum of the mean ordinates of all the strips. 
If these voltages are now multiplied by 8, the result is equal to 
the area of the negative parts of the curve. But the total area 
‘of Fig. 6(d) is zero, so the net voltage induced in the field by 
the currents in both armature and the short-circuited turns is 
zero. Fig. 6(d) now applies at all times. 

Returning now to eqn. (17), the term M,4p/4 corresponds to 
the positive peaks in Fig. 6(d); the term wMj{3/,3 corresponds 
to the negative parts. It follows that the sum of the two terms 
is zero at all times. 

Eqn. (17) therefore reduces to 


V, — (R, =— Lipl, ° . F : : (26) 


Thus the short-circuited-turn equation gives valuable infor- 
mation about the process of commutation, but makes no 
other contribution to the performance of the machine, and may 
therefore be omitted. The mean value of the short-circuited- 
turns term in the armature equation is zero, so it also may be 
omitted. Finally, the short-circuited-turns term in the field 
equation above is cancelled by another term. The object of 
eliminating the short-circuit row and column has therefore been 
achieved. There remain only eqns. (25) and (26), namely 


Vo3 = wM ily + (Ro3 + Lo3p)h3 
Vi Rye), 


which are identical with eqns. (11), the equations determined 
experimentally at the terminals. 

It is now possible to see whence the asymmetry of these 
equations has arisen. The non-zero asymmetric term wMj3/, 
[has as its reciprocal the non-zero term wMj3J>; in the field 
equation. Although this is a large term it is compensated by 
tthe term M,,4pl,. The reciprocal of this is the term Mj,,pJ, in 
the short-circuited-turns equation. This, however, disappears 
| because J; is constant. The result is therefore asymmetric 
because the currents in the windings have been considered and 
not simply the impedances. 

To make this clear, consider the simple case of a two-winding 
| transformer whose impedance matrix is symmetric, and suppose 
that the terminal voltages are manipulated in such a way that 
the current in the primary winding is sinusoidal with angular 
frequency w, whilst the secondary current is direct. Then the 
tterm My,,pI, becomes jwM,,/, whilst the reciprocal term 
| becomes zero, and the symmetry is lost. 


(6) TORQUE OF THE COMMUTATOR MACHINE 


Since the equations of the commutator machine in Sections 5.2 
-and 5.3 have been derived from those of a slip-ring machine, it 
| follows that the torque is given by eqn. (4a). When this equation 
i is applied to the voltage equations (16) it can be seen that dL/d0 
i is simply the set of coefficients of w in the matrix, that is 


This matrix is symmetric, so the following mathematical mani- 
-pulation becomes possible. The torque is given by 


T = 41LdLd0l = IMG + G)I + IAG — GI 


where 


Gi eh - | (M44 — My) 


and G, is its transpose. It will be seen that G + G, is equal to 
dL|d@, whilst G — G, is a skew-symmetric matrix, and its 
quadratic form with the matrix J is identically zero.® 

Evaluating the expression for T gives 


Tice: LG) a wie ao tore ECD) 


The short-circuited-turns row is here zero, and the component 
of torque due to the term on the short-circuited-turns column, 
when evaluated, is found to be —I53(Mj4 — M3,4)l4. For 
correctly wound interpoles the mean value of J4, and hence also 
of this component of torque, is zero. Under these conditions 
the term may be omitted. It must be appreciated, however, 
that underwound or overwound interpoles affect the net torque 
of a motor just as they affect the output characteristic of a 
generator. On the assumption of linear commutation, however, 
the term has no significance, so the short-circuited-turns row 
and column may be omitted from the matrix for G, giving 


(28) 


which is in agreement with the experimentally determined 
eqn. (12), and also with the accepted commutator-primitive 
equations. 

This completes the derivation of the true commutator primitive 
machine from the slip-ring machine. The equations have been 
derived upon the basis of a study of conditions during one 
interval of commutation only. But none of the quantities 
remaining in the equations depends upon this fact, and since 
events during one cycle of commutation are identically repeated 
in all other intervals, it follows that the resulting equations must 
be of general application. 

It is a simple matter to extend the analysis to cover the more 
general case of two stator and two rotor windings, and thus 
obtain eqns. (13). It may be mentioned that these four windings 
do not include the interpole and short-circuited turns of the 
main analysis. 


(7) EXPERIMENTAL RESULTS 


The comprehensive series of tests which was carried out to 
verify the analytical work served also to illustrate some of the 
problems associated with the measurement of the tensor 
quantities of machines. In classical machine tests it is normally 
only mutual inductances, short-circuit inductances and rotational 
inductances that are measured and the methods of measurement 
are well known and accurate. Tensor analysis requires the 
measurement of self-inductances and also the rate of change of 
inductance with position. 

The measurements must be carried out under conditions 
approaching as closely as possible those obtaining in the normal 
operation of the machine. For this reason, a.c. methods of 
inductance measurement are not likely to be of value on a d.c. 
machine with solid poles. High-frequency methods are almost 
certainly valueless, and experience has shown that it is difficult 
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(though not impossible) to obtain good results with normal 
50c/s methods. Ballistic methods were therefore adopted 
throughout. 


(7.1) Experimental Techniques 


Almost all the tests were carried out with the machine 
stationary. The brush-gear was removed and leads were soldered 
directly to appropriate commutator segments. 


(7.1.1) Mutual Inductance Measurement. 


The measurement of mutual inductance using a ballistic 
voltmeter is straightforward. The voltage induced in an open- 
circuit secondary winding by a changing primary current is 
given by V. = MdI,/dt. If a steady direct current be reversed, 
integration gives, for the flux linkage, 


(oa) +h 
y =| vadt = M| dh, = 2MI, 
0 —I; 
The ballistic voltmeter reading is directly proportional to Y, 
giving 
M=3tP/I 

It is necessary for accurate results that the period of the volt- 
meter should be long compared with the time-constant of the 
coil. A ballistic galvanometer with a period of 8sec was used, 
and the time-constant of the coils reduced by using high voltages 
in series with high resistances. 

One of the chief difficulties is with the main field and is due 
to the damping effects of eddy currents in the solid parts of the 
magnet structure. A preliminary series of tests with increasing 
voltage (i.e. diminishing time-constants) enabled a satisfactory 
final arrangement to be obtained. 


(7.1.2) Self-Inductance Measurement. 


The following method, which is believed to be original, was 
used for the measurement of self-inductance. The inductor 
was connected with three non-inductive resistors in the Wheat- 
stone-bridge circuit shown in Fig. 7. Using a d.c. supply, the 


H 


Fig. 7.—Circuit for ballistic measurement of self-inductance. 


bridge was balanced and the supply connections were then 
reversed. The final steady-state condition is again balance, but 
there is a transient disturbance due to the inductance concerned. 
Simple analysis shows that the inductance is given by 


_IVR+R, 
I Ry 
For equal ratio arms, the simpler formula L = Y/J is applicable. 


Some numerical results confirming the method ate given in 
Section 7.3.2. 


L 


(7.1.3) Circuits for Ballistic Inductance Measurement. 


One further point of technique must be mentioned. It was 
decided not to disconnect the armature conductors from their 


A 


te) 


D ARMATURE c 
INTERPOLES FIELD 


Fig. 8.—Circuit for measurement of mutual inductance between the | 


armature-with-interpoles and the short-circuited turns. 


risers. Tests had therefore to be carried out between different | 


parts of aclosed armature. Fig. 8 shows a circuit typical of those 
used. It is for the measurement of the mutual inductance 
between the armature-with-interpoles and the short-circuited 
turns. ABCDA represents the armature winding, shown for 
simplicity as a ring winding. The two parallel paths, AD and 


BC, of the main armature were supplied in parallel from inde- 3 


pendent sources with equal currents. The interpole winding 
carried a current of twice this value. The three supply connec- 


tions were simultaneously reversed through ganged reversing | 


switches R, and the flux linkage in the parallel pairs of short- 
circuited turns AB and CD read on the galvanometer, which was 


connected across the centre taps of two similar resistors S. To | 
simulate normal conditions of operation, the field winding | 


carried its rated current, which was not reversed. 


(7.2) Variation of Inductance with Rotor Position 


The machine tested was a 2-pole, 230-volt, 3kW, 1 400r.p.m. z 


d.c. generator with an interpole winding. 


The commutator segment thickness was 0:155in and the 1 
mica thickness 0:031in. The brush thickness was 0-375in, | 
and the number of armature coils short-circuited by each brush || 


set was therefore very closely two. 


The first series of tests was carried out to determine the } 
variations with rotor position of the sixteen inductance coeffi- 4 
cients of eqn. (5). The experimental results have already been 
given in Fig. 4. One modification to the actual results obtained | 


has been made in plotting the curves of this Figure. This was 


thought to be desirable because the width of the interpole shoe — | 
(0-75 in) was only slightly different from that of the rotor teeth | 


(0-625 in), and with 18 rotor teeth a pronounced 18th harmonic 


therefore appeared in the waveforms of all interpole inductance ‘ 
coefficients. This harmonic has been smoothed out from the 


curves in Fig. 4. 


Full notes on this series of tests cannot be given, but th 
following points are of interest. 


(a) The self-inductance of the short-circuited turns has the | 


expected trapezoidal form, but the maximum value occurs when the. 
turns lie on the interpole axis and not the field axis. It would 
appear that the leakage reluctance of the pole faces is less than the 
direct reluctance of the poles and yoke [Fig. 4(6)]. 

(6) When numbers of such turns are connected in series to form 
the main armature winding, the trapezoidal form disappears, and 


the self-inductance of the main armature differs from a pure sine | 


wave only by the limits of experimental error [Fig. 4(a)]. 


(c) The five curves of mutual inductance shown in Fig. 4 were | 


all measured in both directions. In every case it was found that, 
provided the currents were so adjusted that similar saturation 
conditions obtained, the two values of mutual inductance were the 


same. Most interesting perhaps is the mutual inductance between 


the armature and the interpoles. In Fig. 4(e) the 18th harmonic 
has been smoothed out. Fig. 9 gives the actual results in both 
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directions. It will be seen that the difference between the two sets 
of results is very slight. The test is a severe one for the thesis that 
mutual inductance is always reciprocal, since the interpole Winding 
is lumped whilst the armature winding is distributed and the inter- 
pole permeance is lumped whilst the armature permeance is dis- 
tributed. A further complication is the 18th harmonic. The 
reciprocity shown in this stringent test is considered conclusive. 


Mo3 AND 


-90° fe) +90? 
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ig. 9.—Comparison between the two curves of mutual inductance 
for armature and interpoles obtained from measurements in 
opposite directions. 


(7.3) Variation of Inductance with Current 


The second series of tests was carried out to measure the 
s saturation curves of inductance at the fixed rotor position 0 = 0°. 
|The inductances concerned are those given in eqn. (16). It is 
bconvenient to discuss the results in the same order as that in 
‘which the theory appears in Section 5. 


/(7.3.1) The Short-Circuited-Turns Inductances. 


The most important terms in the short-circuited-turns equation 
: are the self-impedance of the turns and their rotational inductance 
with the armature and interpoles. 

It was found that the self-inductance of the turns was inde- 
‘pendent of their own current and of the interpole-winding 
‘current, but was reduced by 63°% when normal current flowed 
‘in the field winding. This latter value, which represents normal 
‘operating conditions, is the value required. 

It should be pointed out that the reduction in the self- 
‘inductance is attributed entirely to saturation effects. If the 
‘inductance had been measured by normal a.c. methods the 
‘presence of the closed coupled circuit, which the energized field 
‘windings provide, would naturally lead to a reduction in the 
apparent inductance. The ballistic method used, however, 
depends in effect only upon the initial and final steady-state d.c. 
‘conditions and not at all upon the nature of the intervening 
‘transient. 

The inductance varied according to whether the two turns 
‘were in the same or different slots. A mean value considered to 


be adequate is 


Lye ie x 10-4 


It is difficult to predict the effective value of the turns 
resistance, since this includes the brush and contact resistances. 
it will shortly be shown that the value adopted is fortunately 
not critical. The resistance of the coils alone was found to be 
@ 0586 ohm. In the calculations below a value of 0:06 ohm 
fa.most certainly too low) was first taken and then a second 
ylue of 0-120 ohm (likely to be too high). In spite of the 


arbitrary factor of 2 in these values, there is very little difference 
in the results. 

The values of the rotational inductance between the turns and 
(a) the armature, (b) the interpoles, and (c) both interpoles and 
armature, were then measured for the three cases in which the 
coil sides lie in different positions in the same slots, and for the 
fourth case, in which they are in different slots. The effect of 
the field ampere-turns was again appreciable, and the tests were 
carried out with normal field current. In this test also the 
rotational inductance was found to be much lower when the 
coil sides were in different slots. The following mean value for 
the rotational inductance for the armature-with-interpoles and 
the short-circuited turns was obtained: 


M34 — My, = 2°12 x 10-3H/rad 


With these values, it becomes possible to estimate the variation 
of current in the short-circuited turns during commutation and 
hence to determine the efficiency of interpole compensation. 
At a speed of 1 500r.p.m. the value of w is 157rad/s. The brush 
short-circuits two of the 72 commutator segments, so the time 
of commutation is 1-111 x 10~4sec. When these numerical 
values are substituted in eqn. (22), the following equations are 
obtained: 


Ty = [5255655650 GT) 

I, = 122775 sea 371 5e7 2 GAT 
corresponding to the two arbitrary values of R, discussed above. 
Graphs of these curves over the interval of commutation are 


given in Fig. 10, from which it can be seen that the effect of Ry, 
is slight. 
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Fig. 10.—Calculated current during commutation for two values of 
effective short-circuit-turns resistance. 


The ideal value of M5, — M34 as given by eqn. (23) is 
1:99 x 10-3 H/rad for Ry = 0:06 ohm, and 2:06 x 10-7 H/rad 
for R, = 0:12 ohm. The approximate value from eqn. (24) 
is 1-96 x 10-3H/rad. These values compare with the measured 
value of 2:12 x 10~7H/rad. 
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These results show that the both the experimental techniques 
and the theoretical analysis are sound. 


(7.3.2) The Armature and Interpole Inductances. 

The rotational inductance coefficient between the field and 
the armature is important because it is the only one of the 
sixteen which appears in the operational impedance matrix of a 
simple d.c. machine. From the standstill ballistic measurements 
of this quantity the open-circuit characteristic of the machine 
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Fig. 11.—Predicted and measured open-circuit characteristic. 


shown in Fig. 11 was calculated. The same curve was obtained 
by an open-circuit test at normal speed. 

The following results obtained for the self-inductance of the 
armature and interpoles are of interest: 


Self-inductance of the armature alone, L3 0:0423H. 
Self-inductance of the interpoles alone, L> 0:0202H. 
Mutual inductance of armature with interpoles, M23 0-0119H. 
Mutual inductance of interpoles with armature, M3. 0-0115H. 


Predicted value, L2 + L3 — M23 — M32 0-0391H. 
Self-inductance of the armature-interpoles, measured 0-0391H. 


The agreement between the predicted and measured values 
confirms the validity of the bridge method developed for the 
measurement of self-inductance. The reciprocity of the mutual 
inductance is again noticeable. It will be noticed also that, 
although the interpoles overcome armature reaction in the 
commutation zone, yet this action is local. The inductance of 
the armature is greater than that of the interpoles, and the 
latter do not form a compensating winding. 

An important point arose in connection with these tests. 
Although the interpoles and main armature both lie on the 
quadrature axis, and although their self-inductance was inde- 
pendent of the currents they carried, it was found to depend 
critically upon the direct-axis field current, as shown by the 
graph in Fig. 12. The effect is attributed to cross-saturation. 
The wider applications of this are important, namely that it is 
unwise to assume (as is frequently done in synchronous machine 
theory) that direct-axis parameters are unaffected by quadrature- 
axis saturation and vice versa. It is not suggested, however, 
that these results are in any way typical of a normal salient-pole 
synchronous machine. 

The remaining quantity in the armature equation is the 
rotational inductance with the short-circuited turns. Measure- 
ments of this inductance gave a value close to that obtained 
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Fig. 12.—Variation of self-inductance of the armature-with-interpoles 
with field current, showing dependence of a quadrature-axis 
parameter upon a direct-axis quantity. 


in Section 7.3.1 for the reciprocal quantity. The term 
w(M5, — M3,4)l4 has the value 0-3332, and represents the 
amount by which the generated voltage of the machine is 
modified by the presence of underwound or overwound inter- 
poles. 
from linear commutation is so small that the effect of the term 
is negligible—O-6 volt on full load. 

Experience has shown, however, that for low-voltage, heavy- 


current machines with few interpole turns the term may be of | 


appreciable significance. 
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Fig. 13.—Voltages induced in field by armature, due to (a) generator 
and (5) transformer action. 


Fig. 10 shows that in the actual machine the departure — 
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(7.3.3) The Field Inductances. 


The mutual inductance between the short-circuited turns and 
the field was next measured, and from the results the voltage 
induced in the field by the continued reversal of current in the 
short-circuited turns position was calculated. The results are 
shown graphically in Fig. 13. It will be seen that this voltage 
weaches a magnitude of 1 880 volts at full load. 

Measurements of the rotational inductance between the 

rmature and field similarly enabled the voltage generated in 
ithe field by the rotation of the current-carrying armature to be 
calculated. These results are also plotted in Fig. 13. Here 
gain, for normal armature current, the generated voltage is 
very high—2000 volts. 
Since the argument of the equivalence of the transformer and 
enerator voltages induced in the field winding is one of the 
‘keystones of the theoretical analysis given earlier, it would have 
been gratifying if the two voltages plotted in Fig. 13 had been 
ddentical. They do, in fact, show a mean deviation of about 5%. 
his difference is, however, definitely attributable to saturation 
idifferences and to experimental error, and not to any fault in 
the theory. On the contrary, the close agreement of the curves 
ds considered adequate proof of the arguments in Section 5.3. 
Phe main armature consists of two sets of 34 coils as against the 
‘to sets of two in the short-circuited turns. For similar currents, 
therefore, the ampere-turns, and hence the saturation, will be 
iGitferent. 


(8) CONCLUSION 


The author believes that the merits of the unified-machine 
‘theory are such that it will eventually replace the classical theory 
altogether. It cannot do this, however, until it is quite compre- 
hensive; at present it ignores completely the process of com- 
mutation. It is hoped that this gap will be filled by the theory 
of commutation given in the paper, which may also, perhaps, 
shed new light on the mechanism of commutation. It certainly 
shows a physical relation between the slip-ring and commutator 
machines which is of intrinsic interest and which may be com- 
pared with the purely mathematical analogy between the two 
classes of machine developed by Park and stressed by Kron. 
(The conclusion to be drawn is that the performance of any 
electrical machine may be developed from the simple case of 
two coils with relative angular movement. 
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(11) APPENDIX 


If the relation between the actual and transformed currents is 
given by the matrix equation 


{Ee 


it can easily be shown®:® that, for invariant power, the relation 
between the voltages is given by 


V'=CV 
and that between the impedances by 
a Ord 


The appropriate form of C has been given in eqn. (7). 
The new currents are found from the inverse relation 


v=c 


which gives 


I, cos @ + Jy sin 8 


—I,sin@ + I,cos@ 


Since the inverse of C is equal to its transpose, a similar 
relation holds between the original and transformed voltages. 

At this point it is interesting to note that, if the rotor carries 
balanced sinusoidal currents and runs at synchronous speed, 


I, = \/2[I cos (wt — 6)] and I, = +/2[Jsin (ot — ¢)] 


where J is the r.m.s. value of the rotor currents and ¢ their phase 
displacement with respect to the rotor voltages. Since @ = wt + é, 
where 8 is the load angle, it follows that the transformed rotor 
currents are direct currents: 


I, = V/2[L cos (8 + ¢)] and Ig= —V/2[Isin(6 + ¢)] 
Similarly the transformed voltages are direct voltages: 
V, = V/2(V cos 6) and Vg= —/2(Vsin }) 


The two equations also show the physical significance of the 
rotor-current transformations. The rotor m.m.f.’s are resolved 
into their components along the direct and quadrature axes of 
the stator. Historically, this physical resolution of m.m.f.’s was 
developed first, and the mathematical transformation deduced 
from it. 

The transformed impedance matrix is easily found, if it is 
remembered that the operator p acts upon the product of all the 
terms to its right. 
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Matrix multiplication is not commutative, and the preservation 
of the correct order of the terms is essential. In the expression 
C,ZC, the p in Z does not act upon the @ terms in C, since C, 
comes before Z. It does, however, act upon the product of the 
@ terms in Z, the @ terms in C and the current terms. These 
latter terms must be understood, since whilst the expression p/ 
has significance the term p alone, which would otherwise occur, 
is meaningless. With this in mind the analysis is simple. 

First, 


R, + Lip 


M3p 
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its synchronous reactances.2>4 This is yet a further instance of 
! 


| 
! 


the close interrelation between commutator and slip-ring 


machines. 
It remains to justify the expression for the torque given in 
eqn. (10), which was of a different form from that of the slip-ring 
primitive. 
It can be shown that )L/0@ transforms according to the law 


(DL[d0)' = C,QL[20)C 


—M,psin 6 | 


R; cos 0 + (L, — L,)p cos 8 


—R; sin 6 — (L, + L,)p sin 0 


M,p cos 6 


For the second stage, the following simple identities involving 
the differential of a product are required: 


(cos 6 p cos 8 + sin 6 p sin #)I = (p)I 
(cos 6 p sin 8 — sin @ p cos 0)I = (dO/dt)I = (w)I 


Whilst the current does not appear in the impedance matrix 
its presence must always be understood. 
Secondly, then 


7 = OZE= 


where Ly = L, + L, and L, = Lz — Ly, as in eqn. (9). 


It is of interest at this point to apply the transformed equations 
to the case of a simple salient-pole machine without damper 
windings. The second row and column disappear, and since 
for balanced operation it has been seen that the transformed 
currents and voltages are direct, it follows that the p terms also 
vanish. Under these conditions the matrix voltage equation 
may be expanded to read 


VaR 
Vsin 5 = — wL,Icos (6 + ¢) + R3Isin (8 + 4) 
V cos 6 = — (1/4/2)wM, I, + R3Icos(5 + d) + wL gl sin (8 + ) 


The first equation is trivial. The second and third apply to 
motor operation. If the signs of J and, for convenience, of I, 
are reversed to apply to generator operation, it is easily seen 
that the equations are satisfied by the normal voltage (rotating 
vector) diagram of the salient-pole alternator drawn in terms of 


R; sin 0 + (L, — L,)p sin 8 


R; cos 8 + (L + Ly)p cos 0 


and that by direct evaluation (p is absent) 


QL/d8)’ = 


It will be noted that, since p is absent from the expressions, — 
this torque matrix is still symmetric. It also contains non-zero — 


terms in the stator rows. 
The alternative expression is derived as follows. The coeffi- 
cient of w in the transformed impedance matrix is 


Now, 


IG’ = 3G + Gyr’ + 4G — Gyl’ 


But (G — G,) is a skew-symmetric matrix, and therefore its 
quadratic form with I’ is identically zero, as may easily be 
proved by direct multiplication. Furthermore, since 


(G + G) = QL]d6y’ 
T = U,GI' 


gives the torque. This result could equally well be achieved by 
consideration of the power balance of the transformed equations. 
The G form of the torque has the advantage that it may be 
written directly from the transformed equations, without the 
necessity of a separate transformation for 0L/06. It has the 
disadvantage that it does not appear symmetric and therefore is 
not convincing without the above justification. 


it follows that 
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SUMMARY 


The paper summarizes the basic concepts of the sequence-transform 
.theory, particularly emphasizing the non-physical form of samples 
aassumed for mathematical convenience. In dealing with inverse 
‘transforms, it introduces some typical transient modes due to poles of 
| the transform function Y(z). 

It is shown that the root-locus method is well suited to the design 
(of sampling control systems; also that the passage of time can be 
‘described by a system of moving zeros in the complex z-plane. This 
provides a complete picture of the response of a system to a given 
input, both at the sampling instants and between them. 

Barker has suggested a particular form of stabilizing network which 
has the advantage of introducing a modifying function U(z) which is 
independent of the rest of the system. This work has been extended 

y showing the limitations of the method and by giving a table of the 
(networks required to realize some functions U(z). 
Finally, two examples are worked out in some detail to illustrate 
| the theory, and a brief description of an experimental unit is included 
{ far completeness. 


LIST OF SYMBOLS 


‘In general, capital letters are used for sequence-transform 
«expressions, and lower-case letters for the corresponding time 
{functions and their Laplace transforms. 


A(m) = Coefficient of partial fraction term of G(z, m). 
a, B = Reciprocals of time-constants. 
a, b = Positive constants, zeros of U(z). 
b a serr, 
c,d. =— Poles:of U(z). 


a=eEe>*%, 


6 =A delay measured as a fraction of a sampling 
interval O<6< 1. 
6(t — nt) = A Dirac impulse of infinite height and unit area 
at t =n. 


é = Base of natural logarithms. 
F, f = Driving function (forcing function). 
f*@® =f@ sampled. 
G, g = Output function. 
H, h = Subsidiary feedback transfer function. 
K = Frequency-independent open-loop gain. 
m = Inter-sample time parameter measured as a frac- 


tion of the sampling interval O< m< 1, 
m=1—6. 

n= Sample number, the sampling instants being 
Oya Cary b oO, & 


_ O[ Y()] = Order of Y(2), i.e. difference between the degrees 
of numerator and denominator of Y(z). 
p = Laplace transformer operator. The Laplace trans- 


form is defined by f(p) = i f(De-P'dt. 
0 
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apd Technology, University of London. 


q =A positive integer. 

R(z, m) = Ratio of modified to ordinary pulse transfer func- 
tions, = Y(z, m)/ Y(z) or W(z, m)/W(z). 

S =A sampling device producing Dirac impulses of 
area proportional to the instantaneous value of 
the input signal. 

famines 

7 = Sampling interval. 
of the paper. 

V,v; U, u = Two cascaded filters. 
U(z) = Effect of subsidiary feedback on open-loop transfer 
function. 
v(p) = h(p)/p. 
W, w = Open-loop transfer function, the corresponding 
impulse response. 
Y, y = Overall transfer function, 
impulse response. 

z = The sequence-transform operator. 

is therefore an advance operator.) 
6, 6 = Two cascaded filters separated by a sampler. 

A = An integer measuring a delay in terms of sampling 

intervals. 


Used as unit of time in most 


the corresponding 


(z = €?* and 


(1) INTRODUCTION 


There has recently been considerable interest in sampling 
servo systems.”»3:4 The reason is that in many practical cases 
the information used for feedback is only available at discrete 
instants. This could be inherent in the system, as in a chemical 
plant containing an automatic analyser operating on a batch 
basis. Sometimes it is introduced intentionally, usually in order 
to facilitate time-sharing of equipment. This is particularly the 
case if costly equipment such as computers is involved, but other 
factors, such as weight, may be the determinants. 

An excellent survey of the many methods available for treating 
sampling systems is given in Reference 1. The present paper is 
only concerned with the sequence-transform method, and in 
particular with working in the complex z-plane. The basic 
theory used in the paper is described by Barker, but a short 
summary is included here for completeness. Also included in 
the first Section are diagrams and a discussion of the responses 
due to different types of poles of the transfer function G(z, m). 

The application of the root-locus method to sampling-control 
systems is discussed in Section 2. Also shown is a method of 
picturing a complete response, both at the sampling instants and 
between them, in the complex z-plane by a system of poles and 
moving zeros. The method of stabilization by employing local 
feedback across the clamp suggested by Barker is discussed 
further, and the limitations of the method are given. 


(1.1) Sampling Systems 
A sampling system may be defined as a system whose input is 
applied at discrete instants and may be represented basically by 
the block diagram in Fig. 1. The input signal f(p) is first sampled 
by means of a device S, and the resulting sampled signal f*(p) is 
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Fig. 1.—Basic sampled system. 


applied to the filter proper y(p). In the sequence-transform 
theory the following assumptions are made: 


(a) The sampling is performed at regular intervals, 7, in time. 

(b) The sampling device S is such that its output is in the form 
of Dirac impulses of infinite height and of area equal to the 
instantaneous value of the input signal at the sampling instants. 
The sampling action may be represented by the equation 


iO) = x f(t)d(t — nz) (1) 
n=0 


where 8(f) is a Dirac impulse. The corresponding Laplace 
transform is 


f*(p) = 3 flae-P (2) 


n=0 

(c) The output of the system is considered at the sampling 
instants only by imagining there is a sampler in series with it. 
As will be shown later, no information need be lost owing to 
this limitation. 

(d) The system is assumed linear. In this context, linearity is 
defined by the superposition property, and the sampler therefore 
falls within the definition. 


(1.2) The Sequence Transform 
Sampling a function f(t) according to the definition in Sec- 


tion 1.1, the sampler output is given by eqn. (2). Making the 
substitution 
Za eee (3) 
the equation becomes 
Fe) = fra)e-” (4) 
which is defined as the sequence transform of f(t). The sum- 


mation can usually be performed, giving the sequence transform 
in closed form. From the defining equation (3) it can be seen 
that z is the opposite of a delay, or an advance operator. 
The inversion integral corresponding to eqn. (4) can, by a 
transformation from the p-domain, be shown to be 
for) = 5b Fleets (5) 
Dea) 
the integration being performed round a large circle with centre 
at the origin of the z-plane. 
The sequence transform can be obtained by following the steps 


indicated above, or by using tables (see Section 8.1 or 
Reference 5). 


(1.3) The Pulse Transfer Function 


Just as the impulse response of a continuous filter is of great 
utility, so the response of a sampling system to a unit input will 
be found useful. A unit input is defined as a signal of arbitrary 
form, of unit amplitude at zero time and zero at all other 
sampling instants; thus : 


f(nt) = 1; (n=0) 
= 0; (n~0) 
or Eel 


REENSKAUG AND WESTCOTT: DESIGN OF SAMPLING SERVO SYSTEMS IN THE z-PLANE 


If the signal going into the sampler S (Fig. 1) is such a unit input, — 


its output, and therefore the input to the filter proper, will be 
a single Dirac impulse at zero time. The filter output will be 
its indicial response y(t). The sequence transform of this 
response is the pulse transfer function (p.t.f.) of the filter: 

eo 

¥(2) =X ynr)z-" (6) 

n=0 
As an illustration of a pulse transfer function, the simplest 
possible case of a sampler working alone will be considered. 
This corresponds to Fig. 1 with »(p) = 1. The response of such 
a system to a unit input is simply 6(f), giving an output sequence 
S(t), 0, 0, 0, ... The p.t.f. is therefore Y(z) = S(t). It is 
kept in this form since the amplitude of a Dirac impulse at zero 
time is open to argument. 


It can be shown that the response of a system with p.t.f. Y(z) 4 


to a signal whose sequence transform is F(z) is 


Gz) = F@Y@) (7) 


(1.4) Effect of a Pure Time Delay 
If the filter contains a pure time delay At which is an exact 


multiple of the sampling interval, the effect of this delay is easily — 


shown to be a multiplication of the p.t.f. by a factor z~*: 


YG ¥ yonr —Ar)z—"=z-4 S yor) == V4 NAD) (ts) 


If the delay is a fraction of the sampling interval, 57 say, the 


new pulse transfer function may be written 


Y(z, 6) = x y(nt — dr)z—" (9) 
n=0 
or, ifm = 1 — 6, 


Y(z,m) =z~! >) pnt + mr)z-" . 
n=0 


where m is an inter-sample time parameter starting at zero ata | 


sampling instant and increasing with time to 1 at the next 
sampling instant. 
useful for determining the inter-sample response of sampling 
systems. 
Section 8.1 or in Reference 5. 


(1.5) Cascaded Networks 


As pointed out by Barker,° the p.t.f. of two filters U(z) and | 


V(z) in cascade, formally written [VU](z), is only equal to the 
product V(z)U(z) if the two filters are separated by a sampler. 


The reason for this is that V(z) is the p.t.f. corresponding to v(p) : 


preceded by a sampler (see Fig. 1). 


Sometimes the network v(p) may be split into two parts q 


separated by a sampler. Denoting these two parts by 6(p) and 
f(p), the p.t.f. [VU](z) is equal to @(z)[DU](z). The operation 
of lumping the part $(p) with the following filter and then 
finding the p.t.f. of the combination may be written as an opera- 


tional instruction thus: @(z)*4(p), leaving the second filter 
unspecified. 


(1.5.1) The Clamp. 


The impulsive output of the sampler assumed in the theory 
must in some way be smoothed to be practically realizable. 


Commonly used for this purpose is the clamp, which is a device . 


that has a constant output throughout a sampling interval and 
adjusts this output to be equal to the instantaneous value of the 
input signal at every sampling instant. 
is illustrated in Fig. 2. The clamp may be looked upon as 


(10) | 


The modified p.t.f. Y(z, m) will be found very | 


Tables of the modified transforms may be found in | 


Such a clamping action — 
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CLAMPED SIGNAL 


Fig. 2.—Clamping action. 


Forming a step function at a sampling instant and another of 
equal magnitude but opposite sign one sampling interval later. 
Whe formation of the step function at a sampling instant can be 
represented by a sampler, forming a Dirac impulse, followed by 
a pure integrator forming the step. The negative delayed step 
nay be obtained by delaying the input to the sampler by 7 and 
nverting it. This can be done by a unit having the p.t.f. 
—I/z, since 1/z is a delay operator. The complete clamp is 
therefore represented by a unit (1 — z~') followed by a sampler 
and a pure integrator as shown in Fig. 3. This Figure also 


anal is ar eaak 


(a) (b) (c) As (d) 
z-1 ej 


F(z)=1 Fd) Zh 


(a) (b) (cy (d) 


ema ee 


Fig. 3.—Partitioning of clamp transfer function. 


Waveform (a) represents unit input. 
Waveform (d) represents clamp output. 


shows the waveforms at different points in the clamp due to a 
anit input. The clamp may now be described by the operational 
nstruction (1 —z~!)*p~!. It is shown in Section 2.3 that the 
factor (1 — z~') may be modified without altering its multi- 
plicative property. 


(1.6) The Inverse Transform 


A given sequence-transform expression G(z) may be inverted 
joy the inversion integral, eqn. (5), or by using Tables of trans- 
form (see Section 8.1 or Reference 5). Partial fraction expansion 
is used where necessary. G(z) may also be expanded in a series 
of z—!; the coefficients of this series is equal to the output 
sequence. The coefficient of z° is the value of the response at 
” = 0, while that of z! is the value at n = — 1 or t = —r. 
Since a physical system can have no response before the input 
is applied, it follows that the order of the p.t.f. must be zero 
yor less. 


(1.6.1) Response Due to Poles of G(z, m). 

The root-locus method of design is discussed in Section 2.1. 
The usefulness of this approach depends to a large extent on a 
kuowledge of the transient modes due to individual poles in the 
zplane. Since they are different from the modes of continuous 
Systems, they are discussed here in some detail. The pulse 
transfer function Y(z,m) is assumed expanded into partial 
fsactions. The denominators will not be functions of the inter- 
sample time parameter m, since the modes of the system are time- 
‘a iependent; but the coefficients will be, and they determine the 
if, er-sample response of the system. 

fot. 105, ParT C. 
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(a) Simple Real Positive Pole. 


The unit input response due to a pole of the form A(m)z/(z — a) 
for a> 0 is very similar to the response due to a negative real 
pole of the Laplace transform. The response is A(m)a”, i.e. 


an exponentially varying sequence as shown in Fig. 4(a). Since 
Ae 22 ae 
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(C) 
Fig. 4.—Response due to poles of G(z). 


Zz 


(a) GZ) = Saar ou =ar;az>0 

@ G@) =: a) = (—ira"; a > 0 

EG) z2 — az = 0+ a’ Als eis ae a e ae 
(d) G@) = Ee en) = nani 


a is positive, the sequence will increase monotonically for a > 1 
and decrease monotonically fora<1. If a =0, the response 
will become zero within one sampling interval. The form of the 
response between the sampling instants is completely determined 
by A(m), its amplitude being weighted by the exponential term a”. 
The dotted lines joining function values at the sampling instants 
in Fig. 4 have therefore no physical significance. 

If the partial fraction expansion term is divided by z, i.e. if 
the term becomes A(m)/(z — a), the effect is to delay the response 
by one sampling interval (see Section 1.4). 


(6b) Simple Real Negative Pole. 


The response in this case, shown in Fig. 4(b), is identical to 
that for a positive real pole but for the factor (—1)”, which makes 
the sequence oscillate between positive and negative values. The 
appropriate sequence-transform expression is A(m)z/(z + a) for 
a > 0, and the response is A(m)(—a)”. 


(c) A Pair of Complex Simple Poles. 


A typical term of the partial fraction expansion is here 
A(m)z|(z2 — 2az cos @ + a?), the poles being at z = ae+/°, The 
response is g(n, m) = A(m)a"—! (sin n6)/(sin 6). The sequence 
g(n) has been plotted in Fig. 4(4) for different values of @ at 
a= 1/2. Here again the inter-sample response is determined by 
A(m), and altering the power of z in the numerator will shift the 
response to the right or left. 


(d) Double Real Pole. 


The term in the partial-fraction expansion could be 
A(m)z/(z — a)*, the corresponding response being g(n,m 
= A(m)na"—!, The sequence g(n) has been plotted for) 
different values of a in Fig. 4(d). A different power of z in the 

yi 
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numerator of the partial-fraction term results in a shift of the 
complete response. 


(1.7) Stability 


As has been brought out qualftatively by the above inverse 
transforms, the stability criterion in the z-plane is that no 
poles of the pulse transfer function Y(z, m) can lie outside the 
unit circle with centre at the origin. By the defining equation 

= ¢?*, this unit circle is the conformal representation of the 
imaginary axis in the p-plane. The stability criterion is therefore 
strictly analogous to the criterion in the p-plane that no poles 
of the transfer function may lie in the right-half plane. 

Note that the stability criterion must be applied to the com- 
plete transfer function Y(z, m) and not to the transfer function 
describing the system at the sampling instants only, Y(z). If 
the latter criterion only is satisfied, there may be oscillations in 


the steady state which have nodes at the sampling instants. An 
example of this is given in Reference 6. 
(2) CLOSED-LOOP SYSTEMS 
A single-loop sampled servo system is shown in Fig. 5. Its 


transfer function is similar to the one for continuous systems, 
namely 

KW(z) 
‘eo = (1 
This transfer function represents the system at the sampling 


instants only. In order to obtain an expression which describes 


Fig. 6.—System with hypothetical delays for determination of the 
inter-sample response. 


the system at all times, a delay may be cascaded with Y(z) as 
described in Section 1.6. A better method is due to Barker,> 
and is shown diagrammatically in Fig. 6. A fictitious delay d7 
is inserted in the forward path and an equal and opposite delay 
in the feedback path. The loop then remains unchanged, but 
the sequence as evaluated at the output terminals may be made 
to represent any point between the sampling instants by varying 
6 =1-—~m. The pulse transfer function of such a system is, 
from eqn. (11), 


_ KW(z,m) 


NEU ON Sey gi 


(12) 
the loop being closed with unity feedback for simplicity. 

If the filter w(p) can be regarded as being partitioned by a 
sampler, e.g. as the clamp in Fig. 3, the delay 67 modifies the 
last part, [DU](z), of the p.t.f. only. This is evident, since the 
delay is introduced at the end of w(p) and therefore does not 
affect the signals passing through the sampler. 
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(2.1) The Root-Locus Method 


The root-locus method, well known from the design of con- 
tinuous servo systems by Laplace transforms, is admirably suited ] 
to the synthesis of sampled-data systems in the z-plane. The * 


method of constructing the loci is identical in both cases, but 


the conclusions that can be drawn from them are unfortunately 


different. 


The closed-loop transfer function of a sampling servo system * 


Assuming unity feedback, it becomes 


KW(z) 
1+ KW(z) 


is given by eqn. (11). 


Wa) = 


Its characteristic equation may be written 1 + KW(z) =0, or | 
in vector form KW(z) = 1/7 + (2q+ 1)7, where q =0, 1,2,3,... } 


The root-locus method then consists in plotting the loci satisfying 


the phase part of this equation, and marking along them the , 


values of K which make the modulus unity. 
A thorough discussion of the root-locus method as applied 
to Laplace transform expressions has been given by Evans.’ For 


completeness, a few rules for the construction of the loci are | 


given here. 
(a) A complete root-locus always joins a pole to a zero; the roots 


are at the pole positions for kK = 0 and traverse the loci travelling ' 


towards the zero positions as k increases to infinity. 


(b) The number of separate loci equals the number of poles : 


of W(z). 
(c) If the order-of W(z) is —r, i.e. if there is an rth-order zero at 


infinity, there are r asymptotes at angles +180°/r, +540°/r, +900°/r, ' 


LG: ' 
(d) The asymptotes intersect at a point on the real axis, not. / 


necessarily the origin. 

(e) The loci include those sections of the real axis which are to 
the left of an odd number of poles and zeros of W(z). 

(f) The point of departure from the real axis can be found by 
considering a point slightly displaced from it. 
phase angle must be zero at the point of departure. 


(g) The loci will always be symmetrical about the real axis since 


complex poles can occur only in conjugate pairs. 


If one is studying the effect on the root-locus of altering the \ 


pole-zero pattern, it is often convenient to bear in mind the 


analogy between a two-dimensional flow field and the root-locus. : 


A description of this analogy is given in Reference 8, but for a 
qualitative idea it is sufficient to consider the poles as sources and 
the zeros as sinks. The constant-angle loci, of which the 180° 
locus is a special case, are then represented by the flow lines. 


(2.2) Determination of Pole-Zero Pattern for Modified P.T.F. 


There has been a tendency in the past to analyse sampling 
systems in two separate stages, first considering the response at 


the sampling instants and then the inter-sample response. This 
is a reasonable procedure when dealing with servo systems, since | 


the loop is closed at the sampling instants only. It would be 
illuminating, however, if it were possible to treat the system as a 
whole, finding some way of describing it at all times by a con- 
tinuous process. This can in fact be done by using the Y(z, m) 


notation and describing the response by a system of poles and | 


moving zeros in the z-plane. 


The equation in z and m describing the system in Fig. 6 is | 


eqn. (12), which may be written in the form 


KW(z) 


WED) SIRE ™ > KWO 


where R(z,m) = W(z,m)/W(z). The poles and zeros of the 
latter part of eqn. (14) are found by the normal root-locus ~ 
In addition, there are the | 
poles of R(z, m), which are independent of K and also m since © 
Finally, there { 


method as outlined in Section 2.1. 


the modes of the system are time-independent. 


(13) | 


The net change in | | 


(14) | 
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ware the zeros of R(z,m) whose movement will determine the 
iinter-sample response. 

If, as is common in practical systems, the response is continu- 
ous, it will be found that as m-> 1, a zero of R(z, m) tends to the 
origin, and as m->0, a zero tends to infinity. The inversion 


integral is eqn. (5): g(n) =(1/27j) @ G(z)z"—1dz. Owing to the 


‘factor z"—!, a zero is added at the origin at every sampling instant. 
[This may be identified as the zero of R(z,m) moving to the 
origin as m-— > 1. In other words, passage of time may be repre- 
sented by a system of moving zeros in the z-plane, one arriving 
nat the origin and becoming stationary at every sampling instant, 
sand another coming into the finite plane from infinity so that 
the number of moving zeros is always the same. Since only one 
pzero arrives at the origin at each sampling instant and complex 
zeros can only occur in conjugate complex pairs, one would 
pexpect the zeros to move along the real axis. 

If there are discontinuities in the response at the sampling 
instants, a zero will arrive from infinity into the finite plane at 
peach such instant, and a zero will jump from some point to the 
origin. This jump is to be expected, since a discontinuity in the 
response can only be achieved by a discontinuous movement of 
at least one zero. 


(2.3) Stabilization 


The simplest way to stabilize a system in terms of the root- 
(iocus is to add poles and zeros to the complex z-plane in order 
!te shape the locus into a desired form. Additional poles and 
zeros are unfortunately difficult to obtain because the p.t.f. of 
two cascaded networks is not, in general, equal to the product 
hof the individual p.t.f.’s (see Section 1.5). 

If a clamp is used, however, there is a particular type of 
‘stabilizer due to Barker? which overcomes this difficulty. It 
consists of a subsidiary feedback across the clamp, as shown in 
FFig. 7(b). The clamp is here shown in the partitioned form 


CONTINUOUS SIGNAL 
lA 


ayy eae SIGNAL 


Fig. 7.—Two equivalent block diagrams of stabilized system. 


described in Section 1.5.1, and is equivalent to Fig. 7(6) with the 
integrator 1/p inserted in the feedback and forward paths 
‘separately. Because of the feedback, the operational instruction 
ef the clamp can be modified to 


z—1 


Wee Ppa) eel eal) 
P 


where U(p) represents a modifying factor due to h(p) which can 
be: inserted without affecting the pulse transfer function of the 


rest of the system. me 
Considering the subsidiary feedback loop, the modifying 


function U(z) is given by eqn. (11), where W,(z) = oe and 
Z 


[W,W|z) = foceh V(z), since the two networks are separated 
by a sampler. 
Zl 
= | 
é Ge) = = ; where v(p) = AD) 
1 + —YV(z) i 
U(z) = é ee GG 
@) Ze al A(z u) 
z 1—U() 
Vi = 
and (z) aoe ae (17) 


The feedback across the clamp does not alter the multiplicative 
property of its pulse transfer function, since the unit (z — 1)U(z)/z 
is still separated from the rest of the filter by a sampler. The 
pulse transfer function of the modified clamp in operational 
instruction form is therefore eqn. (15). 

In Section 1.3.1 it was shown that the order of the pulse transfer 
function must be zero or less. Since both U(z) and V(z) must 
obey this rule, it follows from eqn. (17) that U(z) is restricted to 
be of zero order. Poles and zeros may therefore be added to 
the root-locus at will in equal numbers. In addition there is, 
of course, the normal condition that complex poles or zeros 
must appear in conjugate pairs. The expressions for h(p) tend 
to become rather complicated, and for this reason a short table 
of U(z), h(p) and networks realizing h(p) has been compiled and 
is given in Section 8.2. 

As an example of a modifying function, consider U(z) 
= (z — a)[(z —d), wherea > 0. By eqn. (17), 

se ek 

Zz z—-d a-—-d (1—az 

z—-1l z—-a  1-—a(z—1)@—a) 
z—d 


From Section 8.1, the corresponding expression for v(p) is 


Up) = — rin ay where a = log, (1/a) 
a—d « 
or hp) ee) ee ee 


which can immediately be realized by network (a) in Fig. 17 if 
the gain K = (a — d)/(1 — a) and a = log, (1/a) = 1/RC. 


(3) EXAMPLES 

In this Section two examples of sampling servo systems are 
worked out in detail. The first example, a clamped single-lag 
system, is very simple and could easily be done by purely 
analytical means. It has been done by a combination of 
graphical and analytical methods here, however, in order to 
illustrate the complementary nature of the two methods. The 
second example is a pure double-integrator system which is used 
to illustrate the method of stabilization described in Section 2.3 
and a graphical procedure for determining the response of the 
system. 

(3.1) Single-Lag System 

The system block diagram is shown in Fig. 8. The operational 
instruction for the clamp is given in Section 1.5.1 as (z—1)/z*1/p. 
The factor 1/p lumped with the lag term Ka/(p + «) is 


Ka 
P(p) up) = 2 54g) 
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Fig. 8.—Single-lag system. 
(a) Block diagram. 
(6) Root-locus diagram. 


From Section 8.1, the corresponding p.t.f. is 


ou irre a (a oad 
[®U|(z) = (1 Mere = oy SN GEeay where _ ne 
and INGE ip) = ob see) es (Gave) 
(d —a)z 
The open-loop transfer function is therefore 
z—1 Z 
W(z, m) = R(z, 1 K 
pow 2 m)( up Zz @— Nea) 
= R(z, ieaiees 


The root-locus diagram corresponding to W(z) is easily plotted 
by the rules given in Section 2.1, namely 


(a) The locus starts at the pole of W(z) at z = a and terminates 
at the zero at infinity. 


(b) There is only one locus, since there is only pole of W(z). 

(c) There is one asymptote at 180°. ie = 

(d) Irrelevant. 

(e) The locus includes the part of the real axis to the left of z = a. 
(f) There is no point of departure from the real axis. 


This is sufficient information to plot the locus [see Fig. 8(5)]. 
The value of gain is given by the modulus condition 
\(d —@KW@)| =1 or K=| —a/a — a)| when z is on the 
locus. The system is on the verge of instability when z = — | 
or K =(1 + a)/(1 — a). In this example, « is taken as 23 or 
a=eé *'3 = 0-1, giving a maximum gain of 1-22. 

From the root-locus diagram, a closed-loop pole is chosen 
which is expected to give the desired response, and the corre- 
sponding value of gain is evaluated. Let the pole be at z = Z1 
and the gain be K = K;. The step response of the system is 
given by eqn. (7): G(z,m) = F(z) Y(z, m), where F(z) = z/(z — 1). 
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oes. Ga Ge eOeah sos 
z—1 (1 —a)z Z—2Z 
(1 — a™)z — (a — a") 
(2D) Gan) 


This expression does not appear in Section 8.1 as it stands, but _ 
by splitting it into a sum the individual parts may be found; thus | 


(l= 24)z 
C= YG 2) 


— a) 


(CAO) = i = la a”) 
rea 


(a 1 —= Zi | 
Z— 1)¢— 2) um 
The first term appears directly; the second is of the same form, | 
but delayed one sampling interval in time; thus | 


g(n, m) = slo —a™)(1 — 2") —(a—a™(1 — zal | 


cs laa {(a = @) =a" = 7) 1a — z)\-1} | 
77. 


Alternatively, G(z,m) may be expanded into partial fractions 
and the inverse transforms found for the individual fractions | 
which will yield the same result. 

The response can be seen to.depend on two time-constants; © 
the open-loop time-constant 1/« determines the inter-sample » 
response, and the closed-loop time-constant 1/(log z;) determines ~ 
the response at the sampling instants. This is fully to be | 
expected, since the loop is effectively closed at the sampling. 
instants only. The response for K,; = 0-67, z; = — 0-5, has | 
been plotted in Fig. 9 (a = 0-1). | 


ine) 


g(n,m) 


Fig. 9.—Step-function response of single-lag system. K = 0-67. 


(3.2) Double-Integrator System 


The block diagram for a pure double-integrator system with | 
clamp iS Shown in Fig. 10(a). Its open-loop transfer function © 
is found in the same way as before, and is 


K z+1 
OS ae ty 
RO n?z* + (1 + 2m — 2m*)z + (1 — m)? 


z(z + 1) 


The numerator of R(z, m) is in the form of a parabola in m4 
indicating that this is the form of the inter-sample response. 
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Fig. 10.—Double-integrator system. 


(a) Block diagram. 
(b) Root-locus diagram. 
(c) R(z, m) diagram. 


‘This could of course be predicted from physical considerations 
jot the system. R(z, m) has two zeros moving along the negative 
real axis, one from infinity to (—1) and the other from (—1) to 
the origin in each sampling interval. A diagram showing the 
poles of R(z,m) and also the locus of its zeros is given in 
Fig. 10(c). In the z-domain, passage of time is represented by 
this movement of zeros coming from infinity and accumulating 
at the origin. For a complete representation of R(z, m), the 
coefficient of z* in the numerator must be taken into account 
thus: R(z, m) = m?*(z — z,)(z — z,)/z(z + 1). 

The root-locus diagram corresponding to W(z) is given in 
Fig. 10(b). It consists of a circle of radius 2 and with centre 
at (—1 +0) and the negative real axis z< —1. The circular 
locus is never inside the stability circle and neither is the locus 
on the real axis. The system is therefore unstable for all values 
of gain. 

In order to stabilize the system the locus has to be distorted 
50 that it is in the stable region for at least some values of gain. 
This can be done by adding a zero inside the locus on the positive 
real axis, its effect being to pull the locus towards it. The pole 
which has to be added, together with the zero, can be placed 
somewhere on the negative real axis in the stable region, resulting 
in a modifying function of the form U(z) = (z —@/(z — d). 
The new system is shown in Fig. 11(@) and the resulting root-locus 
in Fig. 11(b) for a= + 0-6 and d= — 0:6. The necessary 
filter A(p) is given in Section 8.2. 

The open-loop transfer function now becomes 


K(z—az+1) 


WAI > iy ERS 


Riz, m) remains unchanged. 

A graphical method will be used here to determine the system 
response to a step-function input. One of the advantages of 
this method, is that, once the response for one set of parameters 
has been calculated, the necessary parameter changes to obtain 
the required response can be estimated. This is because the 
reative importance of the different terms of the inversion is 
cl-ar throughout the analysis. 
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(D) 
Fig. 11.—Stabilized double-integrator system. 


(a) Block diagram. 
(6) Root-locus diagram. 


The closed-loop transfer function of the system considered in 
this example will be of the form 


Ym) = pee ms e (z — a\(z + 1) | 
ri ee a) 2 (2 —23)(z2 — 2rz cos 6 + r?) 


(18) 


where the first term is R(z,m) and the second corresponds to 
the root-locus. 

The residue at a pole z = Zp is obtained by measuring the 
magnitude and phase angle of the vector joining the point zp to 
each of the other poles and zeros in turn, and then substituting 
the values of these vectors in eqn. (18). 

The complete pole-zero pattern required to find the response 
to a given forcing function is found by the superposition of three 
pole-zero patterns: (i) of R(z, m); (ii) from the root-locus plot 
and (iii) of the forcing function. Such a pattern is given in 
Fig. 12; it represents the system for K = 1-7 (7 = 0:59 and 


see 


O 
Z,=—0-2 


—+— 
(n-IFORDER 
OF ZERO AT 
THE ORIGIN 


Fig. 12.—Pole-zero pattern for calculation of step-response. 

§ = 47°), m=0°:46, and the input is a step-function, F(z) 
=z](z —1). The order of the zero at the origin is (m — 1), 
the third sampling interval has been chosen here for illustration 
The residue at the pole z = zg is 
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Pyar 3t? a 4 b> deer Ye By — n/2 — B3 


Ss 1 S983 == 


= 1°81 “— 21° 

The residue at the other complex pole is, by symmetry,1°81 Ue eae 
and the effect of the conjugate pair of poles is therefore 
1:81 x 2cos 21° = 3-39. The residues at the other poles are 
similarly found, and the value of the response at this point in 
time is the sum of the residues multiplied by the factor Km?/2. 
The value is 1-60. The response for the same value of m in 
other sampling intervals may be found by altering the order of 
the zero at the origin, and for other values of m by calculating 
the new position of the zeros of R(z,m). The complete response 
has been plotted in Fig. 13. 


Fig. 13.—Step-response of stabilized double-integrator system. 


(4) EXPERIMENT 


A short description of an experiment verifying the results of 
Section 3.1, concerned with a single-lag system, is included here 
as an indication of a simple experimental technique for handling 
sampling systems. 

The system was represented on a small electronic analogue 
computer, all elements being orthodox with the exception of the 
clamping device. This is in fact a time demodulator used for 
some radar work.!° The circuit diagram of the clamp is given 
in Fig. 14; its operation is as follows. On the arrival of a 


‘ 


TRIGGER 
PULSES 


INPUT OUTPUT 


Fig. 14.—A simple clamp. 
Ri = 4700 


R2 = 4700 


Ci = 0:01 uF 
Cs = 0:04 uF 
Diodes = 6H6 


trigger pulse, the four diodes are biased into the conducting 
region, connecting the storage capacitor C, to the input and also 
charging the capacitor Cj. When the trigger pulse is removed, 
the charge on C, will bias the diodes beyond cut-off, thus 
isolating the storage capacitor. The clamp has to work into a 
very high resistance in order to avoid droop of the clamped 


voltage; in this case it was working into an open grid. With a 
1:1 isolating transformer, and with no special precautions as 
to balancing of the circuit, the residuals of the trigger pulse | 
appearing at the output terminals were of the order of 0:5 volt, | 
while the signals were kept of the order of 35 volts. The trigger | 
pulses were obtained from a monostable multivibrator via a | 
cathode-follower stage, their amplitude being approximately ' 
100 volts. 

The system was connected as shown diagrammatically in 
Fig. 15. The gain was set by measuring the loop gain with the | 


AMPLIFIER 


i 


SQUARE - WAVE 
GENERATOR 


Fig. 15.—Analogue of a single-lag system. 


capacitor C open-circuited, and the time-constant I/a by 
measuring the gain reduction when the capacitor C was inserted, 
all measurements being done at sampling frequency. | 

The measured reponse was in complete agreement with the 
response calculated in Section 3.1 and plotted in Fig. 9. 


| 


(5) CONCLUSION | 


The sequence-transform method applied to sampling systems { 
is the logical analogy of the Laplace transform method used for ) 
continuous systems. It is therefore not surprising to find that | 
the development of the two methods follows very similar lines. ; 
Since, however, a sampling system has two distinct modes of | 
behaviour—one at the sampling instants and one between them— }/ 
two variables are necessary to describe it completely. This } 
increases somewhat the complexity of the algebra involved, but | 
does not make the fundamental conception any more difficult. | 
The variables chosen in the modified sequence-transform theory, | 
zand m, are mutually independent and no substantial simplifica- } 
tion can be expected by using any other pair of variables. | 

There seems to be no reason why further development should | 
not lead to a sequence-transform theory just as useful and / 
comprehensive as the Laplace transform theory. This involves } 
finding suitable design criteria in the complex z-plane. One | 
such criterion which has been suggested is the conformal repre- | 
sentation in the z-plane of the line of constant damping coefficient q 
in the p-plane. This transformation results in part of a logarith- | 
mic spiral, and, owing to the branch point at p = 0, a region of } 
the z-plane corresponds to a multiple of regions in the p-plane, \ 
with the resulting ambiguity of interpretation. The criterion | 
must therefore include some information as to system band- } 
width to have any meaning, and since this is not always con- { 
venient a better criterion should be sought. One possibility | 
would be a statistical design procedure. For continuous | 
systems, this involves correlation functions. Since these are | 
derived by time-shift considerations, and the operator z repre- | 
sents a time shift, the resulting sequence-transform expressions | 
might be simpler than the corresponding Laplace transform | 
expressions. This should be a rewarding field for further study. | 
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Table 1 
TRANSFORM PAIRS 


Ile (= Da ne + (1 —m)/m 
Zz e 


m2z2 + (1 + 2m — 2m2)z + (1 — m)2 
2(z + 1) 


qm 


s 


Ta ean oe UIC) 
P(p + &) (z@— DE —a) (iene 


B ae az sin B zsin (mB) + asin (1 — m)B 
(p + a)? + 2 Zn) z2 — 2az cos B + a2 as az sin B 


Table 2 


REALIZATION OF STABILIZING FUNCTION 


=(@ 352), a2 + 772 
IE Gh Grae 0 se ae 


(z — a) — b) (a—cfa—d)_ « (ci bb — a) eB 
(z — oe) — d) (a—bd)1—a) p+a (a@—b)\1 — 5) p+ 8 


(z — az + b) (a—c)\a—d) « —(b + c)\(b + dad) p2 + 772 
(Z—o)@—d) (a+ bl —a) pre GDL 8) eae Byte 


(z + a\(z + Bb) (a + co(a+ d) a2 + 72 (b + c)\(b + d) p2 + 72 
G SOG a) @ = al @) Cpe Pa 12 (Gi by) b) 5 eR) Arie Be 


(z — a\(z — 5) a2 — 2ad cos ¢ + d2 b2 — 2bdcos$6 + d2_ B _ 
z2 — 2dzcos ¢ + d2 (a — 5) — a) 4 (6 — a) — b) P+B 


(z—a\(z+ 5) a2—2adcos¢+d2 « b2 + 2bdcos ¢ + d2 p2 + 12 
z2 — 2dzcos ¢ + d2 (a+bd—-a pte = (Gaye by) (Pp) 


(z + az + db) @+2adcosé6+d2 «2+ 72 242bdcosd+d?2 f+ 72 
z2 — 2dzcos ¢ + d2 (b-—a(ita (p++ 7 (= 604 5) Oa pea 
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(8) APPENDICES 


(8.1) Pulse Transform Pairs 


A short list of Laplace transfer functions y(p) of the system 
in Fig. 1, together with the corresponding pulse transfer func- 
tions Y(z), is given in Table 1. The modified transfer functions 
Y(z,m) are given in the form R(z,m) = Y(z,m)/Y(z). The 
Table has been derived from the more complete list of trans- 
forms given by Barker.» The unit of time is taken as the 
sampling interval, 7 = 1. 


(8.2) Realization of Stabilizer 


By eqn. (17) and the Table of inverse transforms in Section 8.1, 
the (Laplace) transfer function of the filter h(p) may be found for 
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different functions U(z). This has been done for some first- and 
second-order forms of U(z) in Table 2. In this work, both U(z) 
and V(z) have been assumed stable, so that the poles and zeros 
of U(z) must lie within the unit circle in the z-plane. Further, 


since a = s~% and b = «~**, both a and b must be positive. | 


The restrictions on the constants are therefore 
0<a< 1°02) <1) Seca + 1 de 


The functions h(p) of Table 2 may be realized by paralleling 
networks as shown in Fig. 16; the only two types of network 


(Db) 


Fig. 17.—Circuits realizing h(p). 
RC = 1/a; Ky = (n/a)2; Ko = (1 + 1/K)K. 


needed to realize the functions are shown in Fig. 17. 
(Laplace) transfer function of the network of Fig. 17(a) is 


a 


pra 
and the transfer function of the network in Fig. 17(d) is 
a2 + 72 
(p + a) + a2 2g ] 


if K, = (nJa)? and Ky = (1 dk ) K. 
1 


The terms of h(p) in Table 2 may readily be associated with 


the terms of eqns. (19) and (20), thus realizing the networks. 


| 


521.313.33 


The Institution of Electrical Engineers 
Monograph No. 304 U 
May 1958 


© 


THE CIRCUIT THEORY AND CALCULATIONS OF POLYPHASE INDUCTION 
MACHINES 
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(The paper was first received 14th May, 1957, and in revised form 22nd January, 1958. It was published as an INSTITUTION MONOGRAPH in 


SUMMARY 


i) The paper examines in a systematic manner the nature of the various 
circuits and circuit parameters of both wound-rotor and squirrel-cage 
nduction machines. From these the basic voltage and torque equa- 
tons are deduced. Substitute variables are then introduced to enable 
che equations to be made to depend on the speed rather than the 
dosition of the rotor, thus making the differential equations linear with 
sonstant coefficients for the important case of constant angular velocity 
of the rotor. 

In order that the theory formulated may be of value in practice, 
methods of evaluation of the various circuit parameters from the 
physical dimensions of the machine are developed. For these the 
existing methods available for conventional design constants have had 
0 be considerably modified. The circuit parameters are computed 
‘or an actual machine and compared against test results. 


LIST OF PRINCIPAL SYMBOLS 


The M.K.S. system of units is used. The values shown in 
square brackets in this and succeeding Sections refer to the 
»xperimental machine discussed in Section 11. 


a, b, c = Subscripts representing phase windings, or 
circuits, of the stator. 

ri, r2...rS = Subscripts representing any of the S circuits, 

or phases, of the rotor. 
re = Subscript representing the end-ring circuit of a 
squirrel-cage rotor. 

U,55 Ure = Voltages applied to 

denoted by a, b, c, rl... re. 

Usay UsBy Uso = Substitute variables for stator voltages. 

Vras Urs Yro = Substitute variables for rotor voltages. 

j i,s, ie = Currents in circuits denoted by a, 4, c, 

WAS gj 1ASSs (AZ: 
iss is8) iso = Substitute variables for stator currents. 
ives tp85 iyo = Substitute variables for rotor currents. 
®,, ©, O., O,; ... 0,5 = Fluxes linking circuits denoted by 
ASD NCa TA eeS 
®,., D,g, D,. = Substitute variables for rotor fluxes. 
R,, Ry, R, = Resistances of stator windings. 
Rrirt + +» RrsrSs Rrere = Resistances of rotor circuits denoted 
los NL 0 6 6 IS AZ 
Rian = Resistance mutual to circuits m and n, where 
m and n range over all circuits rl, 72... 
nS, re. 
Rye = Mutual resistance between the end-ring circuit 
and any one of the S circuits rl, r2... rS, re. 

Rix Rye; Ryo = Substitute variables for rotor resistances. 

Limn = Mutual inductance between circuits m and n, 
where m and n range over all circuits a, b, c, 
pb TP, 6 0 aS UA 

Linm = Self-inductance of circuit m. 


OPO EU Uris « the circuits 


igs ips los i, see 


Correspondence on Monographs is invited for consideration with a view to 
u! lication. < ‘ f ( 
hr. Kesavamurthy and Dr. Bedford are in the Electrical Engineering Department, 


ac /an Institute of Technology, Kharagpur. : ; } 
3 -r. Bedford is temporarily in the Electrical Engineering Department, University of 


¥ ois, U.S.A., as Visiting Assistant Professor. 


May, 1958.) 


L,s = Component of mutual inductance between any 
one of the stator and any one of the rotor 
circuits. 

= Mutual inductance between any one of the 
rotor circuits and the end-ring (re) circuit. 

Lye = Mutual inductance between any one of the 
stator circuits and the end-ring (re) circuit. 

Lia, Lg, Lo = Substitute variables for rotor inductances. 

h, = Radius of rotor surface [0-081 75 m]. 
hy = Inner radius of stator surface [0-082 5m]. 

S = Number of slots on the rotor [44]. 

1 OLGUC. 

g = Gap length [0-75 mm]. 

1, = Effective length of the stator or rotor, i.e. gross 
length corrected for ventilating ducts and end 
fringing [0-1185m]. 

p = Number of poles [4]. 

ds = Number of slots per pole per phase on stator [3]. 

s = Fractional slip. 

Ky, = Permeability factor. 
Kens = Breadth factor of the stator for the nth har- 
monic [sin 30n/3 sin 10n]. 

ns = Pitch factor of the stator for the nth harmonic 

{cos 20n]. 

k, = Carter’s gap-extension coefficient [1-2]. 

7 = Pole pitch as measured on the stator surface. 

@ = Angular displacement, in electrical degrees, 
from the a phase axis of the rotor circuit, rl. 

ys = Angle, in electrical degrees, between the axes 
of two adjacent rotor circuits. 

w = Angular frequency of the supply voltage to the 

stator. 

Ho = Permeability of free space [47 x 10~7H/m]. 

g@ = Angle between the axes of the actuating phase 
and a coil. 


L 


rre 


k 


P 


(1) INTRODUCTION 


Ever since its invention, the polyphase induction motor has 
been the subject of many investigations, notably with the aid 
of the revolving field concept and circle diagrams and equivalent 
circuits based on this concept. Treatment on these lines has 
proved extremely useful in visualizing and analysing steady-state 
performance of the machine. However, for the study of transient 
phenomena and other special problems this concept is inade- 
quate. With a view to removing this inadequacy there has been 
a growing tendency to analyse the machine from the point of 
view of the theory of coupled circuits. The works of Kron,! 
Koenig? and Lyon? are concerned to a large extent with the 
development of such a circuit theory for various classes of 
machines. However, a detailed circuit theory of induction 
machines, particularly those with squirrel-cage rotors, does not 
appear to exist in the literature. 

The paper examines in a systematic manner the nature of the 
various circuits and circuit parameters of both wound-rotor and 
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squirrel-cage induction machines, and obtains the basic voltage 
and torque equations for these types of machines. Substitute 
variables are introduced so that these equations can be made to 
depend on the speed rather than the position of the rotor, 
thereby enabling the differential equations to be linear with 
constant coefficients for the important case of constant angular 
velocity of the rotor. In order that the theory enunciated may 
be of value in practice, methods of evaluating the various circuit 
parameters from the physical dimensions of the machine are 
developed, taking into account the presence of slots, ducts, 
overhang and saturation. For these the existing methods have 
had to be considerably modified. The circuit parameters are 
computed for an actual machine from its dimensions and com- 
pared against test results. 


(2) ELECTRICAL CIRCUITS OF THE MACHINE 


The paper is confined to an induction machine having three 
phases on the stator. In the case of the wound rotor, the rotor 
is assumed to have S symmetrical phases wound for the same 
number of poles as the stator, whilst, for the squirrel cage, it is 
assumed that there are S equally spaced bars on the rotor. 

The three phases a, b and c may be regarded as constituting 
the three stator circuits, and the currents in these are designated 
ig, iy, ip Tespectively. 


For a slip-ring induction motor the S phases rl, r2... rS, 
constitute the S rotor circuits. The various rotor currents are 
designated i,,, i... . i,s. 


For a squirrel-cage winding consisting of a single cage with 
S bars, the number of circuits necessary can be formulated from 
the following considerations. The number of branches in this 
winding is 3S, whilst the number of independent nodes is 
(2S — 1). Consequently, the number of independent loops is 
(S +1). For the purpose of formulating a circuit theory, 
therefore, a squirrel cage must be split up into (S$ + 1) circuits. 
It is convenient to make these circuits as nearly similar to one 
another as possible because this will lead to symmetrical relation- 
ships for the inductance and resistance parameters. One suitable 
choice of rotor circuit is shown in Fig. 1, where the squirrel cage 


Fig. 1.—Circuits of a squirrel-cage rotor. 


is represented as a pancake arrangement. In this Figure the 
first rotor circuit consists of the cage bars 1 and (k + 1), where 
k is any arbitrarily chosen integer, and the portions of the end- 
rings between these bars. In what follows, this rotor circuit is 
referred to as the rl rotor circuit. In a similar manner the th 
rotor circuit consists of the mth and the (7 -+ &)th bars and the 
portion of the end-rings between them. Proceeding thus we 
obtain the S rotor circuits, rl, r2...rS. The (S + 1)th rotor 
circuit may conveniently be chosen round any one of the two 
end-rings. This last circuit is referred to as the re circuit. The 
various rotor currents are correspondingly designated 


i . Ir1) in seks 
i,g and i,¢. 


(3) FLUX LINKAGES WITH CIRCUITS 
The net flux linking any circuit m may be written 


Di le oe 
n 
i 
where n ranges over all the stator and rotor circuits, and Linn | 


is the mutual inductance between circuits m and n. | 
For a squirrel-cage machine the various flux-linkage relations 


are 
For the stator 


s | 

0, = Lagla + IER | x Larnlyn ar Le orelre | 
S . 

®, od Lala i Lopip sn Lacie “i 2s Lornien ar Lbrelre 

nS | 


S 
O, == Leala a Ess IF Miah ate Xi Lernien a Lorelre 
fms | 
For the rotor 


Ss | 

®, = Dytala + Lpryip + Letcle + Xi Ln teniin + Lytrelre (2) | 
n= | 

j 
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S 
®,, ae Ly2ala i L,26lp ie Lrrcle ws 2, Eraenlen Gis L,2relre 
n= 


S 
®,5 = Lsaia + Lyspip + Lrscle + 2s Lrsinirn + Lrsrelre 
n= 
and 


S 
O,, = Neale ate y be sig Dede a Xu Lyernlyn ap Lrerelre 
n= 


The flux-linkage relationships for a machine with a phase- 
wound rotor are identical with those given above, except for the 
fact that ©,,, i,, and the inductance term associated with the 
re circuit do not exist. | 


(4) BASIC VOLTAGE EQUATIONS 


In normal operation the voltages applied to the rotor circuits 
are zero both for the phase-wound and the squirrel-cage 
machines. For impressed voltages v,, ¥, and v, on the stator 
phases of a three-phase machine the equations of voltages are 


d : 
ee ee che RG 
= dw : 
% = 7% + Re 2 3. chien 
d 
—— Hee + i.R, 


where R, = R, = R, = the resistance of a stator phase. 
The voltage equations for the rotor circuits are 


a S Ss ie a 
ah ae Tie breRrtre te x Reiralen 


d Same , 

Cfo) ae si lreR yore ee x Ryornlrn 
t= { 

(3b) 


d : S 
US = aes sf LoRpSre ae X Rr srnirn 


d S 
and re = are ala i Re iF Xi Rrnrelre 
n= 
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where R,,, is the resistance mutual to the circuits m andn. Of 
course, for the phase-wound rotor the last equation does not 
aold. Further, there is no mutual resistance between any two 
‘otor circuits, and hence only the R,,., terms will figure in the 
nbove equations. 


(5) BASIC TORQUE EQUATION 


The net torque developed by the machine can be readily 
dbtained from considerations of energy. In a motor, for 
example, the equation for torque becomes 


te 


(6) NATURE OF CIRCUIT RESISTANCES AND 
INDUCTANCES 
In a squirrel-cage rotor, as regards the stator there are only 
self-resistances, whilst as regards the rotor both self- and mutual- 
esistances exist. Because of the symmetry of the rotor circuits 
thosen, for the circuits rl, r2.. . nS the following relationships 
are satisfied by the mutual-resistance terms 


Kern am Keone @r(n+q) and Retr ae Rins+ 2) ae (5) 


wnere q is any integer. Also, the mutual resistance between the 
“e circuit and any one of the S circuits rl, r2.. . rS is the same 
“or all circuits and is henceforth designated R,,.. 

As regards inductances, it is apparent that if the angle between 
che axes of two adjacent rotor circuits is % electrical degrees, 
che mutual inductances between phase a on the stator and the 
various rotor circuits may be written 


Pe, cos 0 + >, L,,cos ke 
k=3,5, 


Log =Ly5 cos (8 + wb) + 2 Lyxs CoS KCB + x) 6 
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Lisa = Lys cos [8 +(S — 15] 2a Erks COS k(O + (S—1)) 


The mutual inductance between phases 5 and c and these roto? 
sircuits is obtained by substituting (@ — 120°) and (@ + 120°) 
“espectively for 6 in the above equations. For the simplifying 
assumption of sinusoidally distributed phase windings L,;,; = 0, 
k #1). Clearly, also, the mutual inductance between the rotor 
sircuit re and any one of the stator circuits is the same for all 
stator circuits, and hence 


ro pret I (Say) es. «1 D) 

Also, neglecting the influence of slots, 
jh = Ibs SIL. ee eer, va) 
whilst ep aL pod ae nba = Lege, COS. (271/38) oe, (9) 


where Ly»; CoS (27/3) is the component of mutual inductance due 
‘c the fundamental air-gap flux produced by a steady unit current 
‘hrough the actuating phase. 

Similarly, for the rotor, 


Lar L272 ; L,srs (10) 
1rd, because of the symmetry of the circuits chosen, 
Lamm = Lr(m+q)("+q) and Ly1(S+2)—4 = Lira (11) 
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where g is any integer. Also, the mutual inductance between 
any one of the rl, r2 . . . rS circuits and the re circuits is the 


same for all these circuits and may be taken as L,,.. 


(7) TRANSFORMATION OF EQUATIONS TO A REFERENCE 
FRAME FIXED TO THE STATOR 

In what follows we shall concern ourselves with the circuit 
theory of a squirrel-cage induction machine since this is more 
complex than for the wound-rotor type. The equations for a 
wound-rotor machine can always be obtained by neglecting the 
mutual-resistance terms and the quantities associated with the 
end-ring circuit. 


(7.1) Flux-Linkage Relationships 


Substituting the values of inductances as defined in Section 6 
in eqns. (2) we have the flux-linkage relationships as functions 
of 6. Considerable simplification can be effected in the form 
of these equations by means of a transformation which 
eliminates @. 

Such a transformation consists in defining new fluxes 0,,, D,g 
and ,, as 


D,., = ®,, cos 6+ O,, cos (6 + ys) + ©, cos (0+ 2p) +... 
+ O,s cos [8 + (S — 1x] 


,, =®,, sin 6 + ®,, sin(6 +4) + ®,;sin(9+2))+... 
+ ®,5 sin (6 + (S — 1)] 


©,,=®,, aS ®,2 ag ®,; +... + Ds 


(12) 


On carrying through this transformation and bearing in mind 
that Sy = 2pz, it will be found that 


Dia = FSi,L,, + FSi,L «COS 120° 
+ 4Si,L;5 COS 240° + tala + ieLyp 


OD, = 45Si,L,, sin 120° + 4Si.L,, sin 240° (13) 
= tales ip Lpa 
®,, = TLolro + Strole 
where 
1b, = Lain a ye cos ob a ae). + Tes cos (S— Ib 
Lp =L,1;2 sin & + L,1,38in 2 +... 
: (14) 
+ Lys sin (S — 1) 
Lyo = Lari ae Lira an L173 Rha Lrirs 
and 
ia = i,,cos@+i,,.cos(6+¢%) +... 
+ ig cos [8 + (S — 1)f] 
ig =i,,sin@ +i.sin(@+ +... (15) 


+ i,s sin [9 + (S — 1h] 


irs J 


Se ae re i ees 
Tro = Trt L2 T 13 fase 


Clearly L,g = 0, since Lytrq = Ly15s—g+2) [see eqn. (11)] and 
sings = — sin (S — q)b. 


(7.2) Voltage Equations 


The rotor voltages can similarly be transformed by means of 
the substitute voltages 0,2, Vg, Upo defined by 


Vr, = Y,1 COS O + VY, cos (9 + pf) +.-- 
+ v,5 cos [0 + (S — 1] 


= v,, sin? + »,.sin(@ + p+... 
+ v,5 sin [8 + (S — Dx] 


Vro = Uri + Up daa ek UPS 
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(16) 


On carrying out the necessary simplifications using eqn. (3) 
and bearing in mind the relationships between the various mutual 


resistances given in Section 6, it will be found that 


di, di, dic 4 »| 
0=Ya= bln) 4h Seer cos (27/3) 4 =F cos (4277/3) 
ding dig ; ; 
é | Rede. R,¢i-e 
+ Lye dt Lg dt rot! 
+ {4SL,5 [ip sin (27/3) + i, sin 47/3)] eae 
+ Lois — L,Blpo. dt 
i Hive | 
Se ene Bb bie te z 
0=%,2 = 15] 4 sin (27/3) + i iO (47/3) 
j di : 
as me ee an + Ryaira + Ryplre 


— {457 [ig + ig cos (27/3) + i, cos (427/3)] é 
+ Lyatpa, + Lie} 


| (17) 


di, di, : 
0 = % = Lio a ley bee ae Ryolro 
and 
(ihe , F 
0= Ure = rea dp ic) 
: di di 
an Lie ai Lreree =i Rrelre 
where 
Rr. = Rriri tp Rrira cos b Tees + Rrirs cos (S — 1)xs 
Rg = Ry sin & + R43 sin 2+... (18) 
+ Rips sin (S — 1) 
Ryo = Rint + Rrire apiece Rrirs 
Clearly 
R,g = 0, since Ryirg —~ IN ir(S+2—q) and sin gb = — SiS —qyp 


By the use of substitute rotor currents ig, i,g, i,o defined by 
eqn. (15) it will be found that the stator voltage equations also 
become functions of d@/dt instead of 0. Their form, however, 
can be made still more elegant by defining new stator voltages 


Usa, Usg, and U,, and new stator currents i,., is, i; by means of 


the equations 


Usa, = Vg + Vp COS (2717/3) + ¥, cos (4717/3) 
UsQ == Vp Sin (2277/3) + ¥, sin (47/3) 
Usq == Ug 1 Up V6 

and 


is, = Ig + iy CoS (27/3) + i, cos (47/3) 
i,g = Uy Sin (277/3) + i, sin (4277/3) 
lso =1g +h +i, 


| 


(19) 


(20) 


On carrying through these substitutions in the equations for 4 
stator and rotor voltages and simplifying, the following equations i 
are obtained: 


dig NB edi | 

Usa = Ralsa (ona Lap) ae a ars re \ 

| 

; dino mE aie tl 

i 

di, 32 Wad p 

Uso ae Ralso ot (Ea 2L ap) i | 5 Lsre i | 
1 di, 1 . do 

Org, = 7 5hrs ae ! 7 Lrsts8 7 ; 

: di, nal f 

+ Ryolpa 7 La St LraliB 2) 4 

i] 

_ do 1 di, 

v1.2 = — BSL; sisa 7 ae Shs 7 

. 40 ; di,g 

= Lyalra a ae RralyB + Lia | 

: di, di, 4 

Vro0 ae oko Nhe 7 } SLirea 

| 

di di , di 

re = Levee Si Lie ar Rrerelre AP Loree ; 


since, for a symmetrical machine, L,g = 0. These equations. 
constitute the basic voltage equations of the induction machine 
with squirrel-cage rotor. 


(7.3) Physical Interpretation of the Voltage Equations for a_ 
Symmetrical Machine : 


From eqns. (13), (19), (20) and (21), when simplified and 
reduced, it is seen that for a symmetrical machine / 


2 . 
Vsq, = Rolsa. + (Lea Jb a) diss ale Ce AD x, 


4L./ dt 207 dl 
Usp = Raise + (Lea — Lap ta ae si ) Bee 
22 
et ee ?_ 
oe =e SOs + R,ei,a 
From these equations it may be concluded that the machine ; 


may be replaced by a pair of coils along each of the «- and B-axes — 
(which are in quadrature) and acted upon by the direct- and _ 
quadrature-axis fluxes ®,,, D,, respectively. 


(7.4) Torque Equations 


From eqn. (4) the torque equation in terms of these substitute — 
variables can be readily obtained. With the necessary simplifica- 
tion the expression for torque reduces to 


c= $PL,s(iseipcr = iscelp) (23) | 


In order that this circuit theory may be of practical use it is 
essential that methods should be available for computing from — 
the dimensions of the machine the various circuit parameters 
that occur in these equations. Whilst there is a considerable 
amount of literature dealing with the evaluation of conventional 
design constants from machine dimensions, none exists wherein 


methods of evaluating circuit parameters of the type discussed 
; are presented. However, in what follows, many methods used 
(for evaluating conventional design constants are here adopted 
| for determining the circuit constants. 


((8) EVALUATION OF SELF- AND MUTUAL INDUCTANCES 
OF STATOR PHASES 
Both kinds of inductance associated with the stator phases 
| may be regarded as due to air-gap fluxes, slot fluxes and over- 
| hang fluxes. 


(8.1) Component of Inductance due to Air-Gap Fluxes 


To compute this component it is necessary to determine the 
| field distribution produced in the air-gap when a unit current is 
‘sent through any one stator phase. Obviously, this field distri- 
| bution will be profoundly influenced by the presence of slots 
‘and ducts and also by saturation in the iron. In order to 
‘simplify the problem we shall initially ignore the présence of 
‘slots and also the ampere-turns expended on the iron paths. 
‘On this basis it is now possible to associate a scalar magneto- 
‘ static potential (s.m.p.) with the stator and rotor surfaces as 
| discussed in Section 14.1. This results in a radial flux density 
at the stator surface given by By [eqns. (39) and (40)]. When 
‘the actuating winding carries unit current, the flux linking N 
‘turns, having k,,, and k,,; as the winding factors for the nth 

harmonic, is given by 


A+41 


Uo2l,N? ins, LEN7I, = ki. Kbns hy? Se hi? 
a hyByodO = yobs : ia? — va cos nA 
A-A40 


where A is the angle between the axes of the actuating phase and 
the coil in question. Hence, the self-inductance due to air-gap 
fluxes (A = 0) is 

Be Lhy > 


(“eno ins Hendin) = 0-2239H] (24) 
ne 3 


_gaair-gap) = w 


since, in practice, the air-gap length g <f,. In a similar 
manner the mutual inductance between phases a and b in a 
machine is given by 


[o4N21h5 x hee ons 
TPS piles} rn? 
[= — 0-1069H for A = 27/3] 


_apair-gap) = cosmA . (25) 


(8.1.1) Correction for Slots and Finite Permeability. 


In order to allow for the presence of slots the gap length 
may be regarded as being k,g. Despite this correction the 
inductance values are bound to be too high because of the 
assumption that the iron is of infinite permeability. The s.m.p. 
difference actually operating across the air-gap at any point on 
the stator surface is only a part of that calculated on the basis of 
infinite permeability. It is clear that this ratio will vary from 
point to point along the surface because of the variation in flux 
densities. A strict allowance for this effect is therefore out of 
the question, and recourse must be had to approximate methods. 
In what follows a ‘permeability factor’ K,, defined by the expres- 
sion K, X (inductance computed on basis of infinite per- 
meability) = (actual inductance), is used. This factor may 
tnerefore be regarded as averaging out the effects of saturation. 
in the case of balanced 3-phase supplies, to compute for K, 
determine (a) the flux per pole, (6) the areas and lengths of the 
‘nagnetic circuit, taking the length of path in the core as 2/3 of 
‘ne pole pitch, (c) the flux density in each path at 60 electrical 
Aegrees from zero point, and (d) the ampere-turns for each path 
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at 60°. The ratio of gap ampere-turns to the total ampere-turns 
is then reckoned as the saturation factor. 


(8.2) Component of Inductance due to Slot Fluxes 


Assuming that the flux passes straight across the slot, the self- 
inductance per slot for a slot shown in Fig. 2 and carrying 
current in the same direction in both layers? is 


eee _ 8dy 4d,+dy | ad 


key (slot flux) = 
ie ; 4 Wee ww) Wee 3W 


[=2945 x 10-7 H] 


where n, = Number of conductors per slot [38]. 


pemtene 


ajuk « 
+ 4EIR 
Pha CA 


Fig. 2.—Stator slot dimensions. 


w, = 0:3175cm. dz = 0:0508cm, 
w2 = 1:113cm. dz = 0:°381cm. 
do =0:0508cm. ds =0°3175cm, 
dy = 1-956.cm, ds = 0. 


Similarly, the self-inductance per slot for the bottom layer 
alone carrying current is 


| Be the Lonel, ds 2d4 es 2(d; —< dy) + 3d, 
‘aa (slot flux) 4 (wy, at W>) 3W> 
[=1037 x 10-7 H] 


Also the self-inductance per slot for the top layer alone carrying 
current is 


mw = Lonsl.| ds 2d4 d; , (d, — a) 
Liga (slot flux) A 12 os Gene) a3 a, 
[=576:5 x10" 78] 
Thus, 


Laa(slot flux) — Liat sig E05 ot Liyi83 [= 0-002 47 H] 


where 6,, 55, and 63, denote the number of slots pertaining to 


the respective cases. 
Proceeding in a similar manner, the mutual inductance per 


slot in top and bottom layers is 
, _ __ Kotsle| i — a) | as 2d4 a 
Lga(slot flux) 4 il 4w, ae - ar (wW, ar W) ar 7 
[= 6) 534s 0nd 


Hence 
LobGiot ind = Cel ah ortnn |= O° 23. KOS ara) 


where 6,4 denotes the number of slots to be considered [8]. 


(8.3) Component of Inductance due to Overhang Fluxes 


A rigorous analytical computation for various types of end- 
winding configurations is practically impossible. In what 
follows we shall attempt to deduce expressions for the self- and 
mutual inductances due to overhang fluxes for machines with 
diamond windings using the formula deduced by Alger.* 
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Alger’s analysis is for balanced 3-phase operation and is based 
on a resolution of the overhang currents into an axial system 
and a peripheral system of currents. The expression given for 
the end-leakage reactance due to axial currents is 


789 x 10-8mfD,N7k},, tan a (ym — sin vir 
AGy 9 p ( 7 


[=0-5489 ohm] (27) 


where m = Number of stator phases [3]. 
v = Ratio of coil span to pole pitch [7/9]. 
D, = Mean diameter of the overhang [0:222 m]. 
f = Supply frequency [50c/s]. 
: w+ec 
sino ag 
where w = Slot width [0-011 1 m]. 
t = Tooth width [0:0064 m]. 
c = Coil end clearance [0-002 4 m]. 


[=17/22]. 


The end-leakage reactance due to peripheral currents is similarly 
given thus 


319-7 x 10-8mf N2k2, 5k?1, 
é 


Xx, 


= <= [ D, log (4D,/r;) —1-75D,] 


[=0-7017 ohm] (28) 


where r = Half depth of stator slot, [0-012 7m]. 

The net reactance in a 3-phase machine is the sum of X,, 
and X,,, and the effective inductance per phase is obtained 
by dividing this reactance by 27f. For computing the self- 
inductance due to overhang fluxes we multiply the effective induc- 
tance by 2/3, whilst the mutual inductance is taken as —1/3 of the 
effective inductance. This type of division is adopted because 
it is the ratio in which the inductances due to the fundamental 
component of air-gap fluxes divide themselves. Thus, 


i 2 : 
Tentecwes = Sap 3 Kea + X,.), [=2:66 x 10-3 H] 


and 
)hegip (overhang) — —_ (1 /67f)(Xeq Ie Xe), [= —1-34x 10g H] 


Summarizing, the total self- and mutual inductances of the stator 
phase are respectively given by 


K, 
ibn = Fp Laa(air-zap) =F Line (slot flux) =P La (overhang) (29) 
g 
[=0-1788 H for K, = 0-93, k, = 1-2] 


K 
and lb = 5p Lab (air-gap) aig Lab (slot flux) aR Lap (overhang) (30) 
cs: 
[=0-0847 for K,, = 0-93, k, = 1-2] 


(9) EVALUATION OF SELF- AND MUTUAL INDUCTANCES 
OF ROTOR CIRCUITS 
Here also the computation of the inductances depends on 
air-gap fluxes, slot fluxes and overhang fluxes. 


(9.1) Component of Inductance due to Air-Gap Fluxes 


In the case of squirrel-cage rotors, to compute the inductances 
between the rl and any other circuit, say the rn, we proceed as 
before by initially neglecting this influence of slots and satura- 
tion, and defining a s.m.p. along the stator and rotor surfaces 
as shown in Section 14.2. The flux linkage per ampere with the 
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rn circuit (comprised of cage bars n and n + k and the portion 
of the end-rings) is then, from eqn. (40), 


(2(n—1)/S]+(4/S) 2 
kl ma 
KoW'e Byyhyd0 = Dy ap COS a ae 
S m=1,2,3 S 
2(n—1)/S]—(k/S) 


[ 
where 


A ( 4 ) 


; _,mkn poh... 5 ae 
Lr =Lole (Pa RR) sin sin S 


S meer 
: (31) 


Thus the component of inductance due to air-gap fluxes between 
the rl and rn circuits is L;,, cos [(n — 1)2m7/S]. These values 
have to be corrected using the factors K,, and k,, as in the other 
cases. Furthermore, the effects of skew have to be taken into 
account. It is shown in Section 14.3 that each harmonic com- 
ponent of any inductance between rl, r2 . . . rn circuits is multi- 
plied by the square of the skew factor k?,,, for that harmonic, 
whilst any inductance between the stator circuit and one of 
rl, 12... « mm circuits is, multiplied by .4,,,,, where va. 
= sin (mx/p)](x/p) and x/p = angle of skew [10° or 0-174 5 rad]. 
Thus the component of inductance due to air-gap fluxes between 
the rl and rn circuits is 


MIT 


K mn — 1)2m 
Lytrn(air-gap) = — ees BO AES 
g m=1,2,3 


where sl), pete hee 


[Li = 7:2 H; Ly, = 3°5 H; Ls, = 0:74 H; Ly, = 0; : 
Ls, = 0°224H; Lg = 0-28 H] | 


(32) 


(9.2) Component of Inductance due to Slot Fluxes 


In the case of bars of circular section, assuming the flux to 
pass straight across the slot, the inductance is known to be 


Lys = 0-623. [= 1 x 10-7H] 


As regards mutual inductance, clearly all circuits, other than 
r(k + 1) and r¢S + 1 — k) circuits, have no mutual inductances 
due to slot fluxes, whilst these circuits have a mutual inductance 
with the rl circuit equal to the inductance of one slot. 


(9.3) Component of Inductance due to End-Ring of a 
Squirrel-Cage Rotor 


Note that the rotor end may be taken to be a flux line. On 
this basis, using the method of magnetic images applicable to 
iron of infinite permeability, the field linkages when a unit current 
is sent through the end-rings are equal to those produced with 
one ring. When two equal coaxial rings are spaced apart by a 
distance equal to twice that by which an end-ring overhangs the 
rotor, both rings carry equal currents in opposite directions. 
The inductance of one end-ring is then clearly the self-inductance 
of one such ring minus the mutual inductance between the 
two rings. The Rayleigh-Niven formula for the self-induc- 
tance of a single-turn coil of circular cross-section of diameter 
D [=15-1 x 10-*m] and radius of cross-section r [=0-8 
< 10571 )|jis 


2 4D r2 
L = 2nD| (1 a) hi) a skis ae “7 
nD| “e log tre 1-75] x 1077, 
[= 2-43 x 10-7H] 


whilst the Maxwell formula for the mutual inductance between 
two similar coaxial coils is 


3d2 2D d2 
Nie 2nd | (1 ms a) log — (2 ae a] x 10-7, 
[=1-573 x 10-7H] 


‘where d is the projection of the end-ring over the iron 
[0-813 x 10-?m]. The self-inductance of the end-ring is there- 


fore L — M. It must be noted that r is the effective radius of 
the section of the end-ring and may be taken as 4/ (a/7), where 
a is the area of section. The quantity Leo appearing in the 
}} computation is taken as the inductance associated with a portion 
of the end-ring between two adjacent bars. Taking both rings 
into account, the approximate equation is 
Lg = 2(L — M)/S, [= 3-884 x 10-°H] . (33) 
Summarizing, 
Koo ) 
Ltr = 21 ps 53 Ld bry =I pe »2 Tones 
g m=1,2 
[= (K,,101-2 + 2-42) x 1077] 
K, og 2m 
LAr =0+ (k 7 1) Ss ar is > one COs ere 
g m=1,2 Pp 
: Y Grapegece re) 
L173 =0 (k, 2)Leo a3, Linr COs uae 
ky Tae 
fel gf) hes (k — 1)2mps 
Lr trk 0 + Leo 1 ie 2 Line SS 
g m=1,2 
K, k2m 
Lrirnk+1) == (Lg, 1 Olas ie S sb COs u 
g m=1,2 Pp 34) 
( 
KG ea 2m 
Leirck+2) =0+0 tee >> Line COS (k pa eee 
g m=1,2 


Kye S—k)2m 
Lirns—k+1) = —44s +0 aie _ spr cos Sai 
g ae iy 
oe) 


K 
Ly ir(S—k-+2) = Ono. — 


(S —k + 1)2nefs 
D 


Lt 7§—1) =0 Se (k oo 2) Tore 
Ki — 
me (S — S + 2)2meb 
Pp 


Lyjrs =0 + (k — 1)Le6 


Ku om } 
aes Srareecos (S — S + 1)2mis 
k, m=1,2 D J 


(9.4) The Inductance, L,., 


In actual computation, the value that sufficiently represents 
the rotor inductances in a symmetrical machine is the quantity 
Lira, given by 


Dra = Lyn + Lan cose +... + L1,5 cos (S — 1x 


From eqn. (34) it is clear that the part of L,, due to slot 
fluxes alone is 


Lyx, (slot fluxes) = 2Ly5(1 — COs kb) 
since Sys = pz, p being an even number. 
Also, the part of L,, due to overhang fluxes is 


Lore. (overhang) = KLeg + 2Leo[(k — 1) cos + (Kk —2) cos 2b +... 
+ (k —k + 1) cos #] 
(1 — cos ke) 
~ (1 —cos i) 
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Next, to compute the part of L,, due to air- gap fluxes alone, 
we consider separately the cases when S§ is odd and even. 


(a) For S odd.—Here the part of L,, due to air-gap fluxes is 


Ky, US—1) 7, © 2mn 
a fl § XL +2 3 (EL 005 # 60 nb) | 
glLm= n=1,2 \m=1,2 Pp 
K 
Cg | m=(qS—4p) m=qS+4p 
qa=1,2 g=O0nle2 


Substituting for L,,,, we find that the above expression simplifies to 


Lire (air-gap) 


Ke 
“2 va — cos ki) SL iro 


rs(S—4p) , k.005 4p) a 


where L,,. =—(Lh;I2) 
= dil) (ya QS — ip? 


kp rsp) 
(Sp)? 


k2 k2 
vale rs(S— 4p) ne rs(2S+4p) | 
GS 3p)? “(2Se tp) 


[= 0-1039 x 10=° HI] 
(6) For S even. 


It can be shown that for this case 


K 
Dra (ait-gap) = Te (1 — cos kef\(S + 2)Lyr0 
g 


Thus 
Lye as Dire (air-gap) oP J bee (slot flux) a Le (overhang) 
Leo 


25) Seven 


Ky 

— (1 — cos kx) ic ae 2) ere ae PM bee ae 
g 

(35) 


25} Sodd (36) 
— cos fs 


= (1 —cos ky | —" SLiro + 2Lg5 + ; 


[= (K,953-1 + 2-95) x 10-7H] 


(9.5) The Inductance Lrg 


As shown in Section 7.1 for a symmetrical machine, L-, = 0. 


(10) MUTUAL INDUCTANCE BETWEEN CIRCUITS ON 
THE STATOR AND CIRCUITS ON THE ROTOR 


(10.1) Inductance due to Air-Gap Fluxes 


To compute the mutual inductance between a stator phase 
and any one of the rotor circuits rl, r2. . . rS, we note that a 
current through phase a results in a scalar potential distribution 
throughout the air-gap, and the radial flux density, B,,, at the 
rotor surface (r = h,) is given in Section 14.1. A full-pitched 
coil on the rotor with its axis making an angle « electrical 
degrees has flux linkages given by 


(n+2a)/p 
1/2np},1/2np 
Loh i ei 2[oh 1/ 1 hy h 
WeVee Bd) = ——— 2 aS 1)12@—-D4q ns Te — iP COs Na 
(—n-+20)/p 


Hence, for a distributed winding having N, turns per phase on 
the rotor, with pitch and distribution factor for the nth harmonic 
given by Kyn-kpnr, the mutual flux linkages are 


2h; ii 5 ee h )1/2np 
LoN Konr Kone(— ye? . —1)1/2@-1) — 2 


ne — pip — COS Na 
1 
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where a, is computed for a current of 1 amp through the stator 
phase. 
For the case of squirrel-cage rotors, the flux linkage per unit 
current with a circuit « electrical degrees away from the rotor is 
(ky +2a)/p 
Dart — “ik | Byyh,d 


(—ky +2a)/p 


SIVA ZN et epeN aa eee ‘ 
= Kyle gp3 _) A | PASE) baad sin 1a | cos g@ 


co K,, 
= » : je Ema cos gO, say. 
a= g 


[= K,,(0-8968 x 10-3 cos 8 — 0:03215 x 10~4 cos 38) H] 
(37) 
where k;,, = kysm for m = 4qp and is the skew factor for the 


rotor bar (see Section 14.3). The above expression is corrected 
for permeability and air-gap extension. 


(10.2) Component of Inductance due to Overhung Fluxes 


Following Alger, the mutual reactance due to axial currents 
may be taken, for the case of balanced polyphase supplies, as 
0-8D, 
(5, 
whilst the mutual reactance due to peripheral currents is 


1/2p 
) x (Self-reactance due to axial current in the stator), 


4D 
D log, — — 2D 
R eros per phase 
due to axial currents 


) G8) 


D, loge 2 — 1-75D; 
1 
The mutual inductance between stator and rotor during balanced 
operation due to axial and peripheral currents may be obtained 
from the above expression on dividing by 27f. It is suggested 
that these inductances multiplied by (2/3)(2/S) may be regarded 
as the maximum value of the mutual inductance between the 
stator and rotor circuits, the inductance being assumed to vary 
as a sinusoidal function of the rotor position. 
In the above expressions 


D = (D,D,)'? 


where D, is the mean diameter of the stator overhang, and 
D, that of the rotor overhang. 


R = [4D — Dz? + (%) — YP]! 


vaD v2 

here Y= “71(1 =) 

where Y, &p aE 5 tan « 
7D 2 

and Y, = aba “(1 +) tan «, 
4p 5 


V, V,, &, &,, being as defined in Section 8.3. 

In the case of a squirrel-cage rotor it is also necessary to 
compute the mutual inductance between the stator overhang 
and the rotor end-ring. As shown by Alger, this may be taken 
as two-thirds of the mutual inductance between stator and rotor 
circuits due to peripheral and axial currents. 


(11) APPLICATION OF THE THEORY AND ITS 
EXPERIMENTAL VERIFICATION 
The circuit constants mentioned above were estimated for 


an experimental squirrel-cage induction motor having the follow- 
ing specification: 


440 volts, 14h.p., 4 pole, 3-phase, SOc/s delta connected, 


3 slots/pole/phase, double-layer winding with 38 conductors per — 


slot and chorded by 2 slots. The dimensions of parts of the 
machine and the computed values have been given in brackets 
in the preceding Sections (see List of Principal Symbols). 

The permeability factor, K,,, was estimated for various voltages 
as outlined in Section 8.1.1; the values are 


Voltage .. .. 50-250V 320V 440V 
Ky, or ca 0:93 OFS3emOCOr, 


Several of the estimated inductances were compared with 
values obtained experimentally. At first the cage was removed 
and replaced by a few circuits wound on the rotor. The stator 
self- and mutual inductances were obtained in the usual manner. 
The mutual inductance between a stator phase and a rotor circuit 
was computed for various positions of the rotor and a curve of 
inductance as a function of position was plotted. The funda- 
mental component of the curve is taken to represent L,,, cos @. 
Next, to determine the inductance between rotor circuits, a 
known current was passed through the rl circuit and the voltages 
induced in others were measured. This voltage divided by the 
product of the number of turns in each of the circuits in question 
gives the reactance operation for a single turn in each. To 
measure the self-inductance of a rotor circuit it was found 
advisable to wind an additional coil following the actuating coil. 

Table 1 shows a comparison between the experimental and 


Table 1 


COMPARISON OF CALCULATED AND EXPERIMENTAL WALUES 
FOR THE CIRCUIT PARAMETERS 


Calculated Experimental 
Laa, henry 788 0-184 
Lab, henry 0:0847 0-0875 
(Kp. = 0-93) 
Lari, henry 0-834 x 10-3 0-853 x 10-3 
(Kyu = 0:93) 
Lytri, henry 1-456 x 10-5 eo S< iOes: 
Xm, ohms 83-93 86 
(Ku. = 0-93) 
Isc, amperes 67 71 
(Ku = 0-67) 
Io, amperes 7-0 Tes 
(Ky = 0-93) 
Zsc, ohms 2:95 + j7:26 
(Ky, = 0:67) 


computed values. The computation of the short-circuit and 
no-load currents is based on eqn. (21). It is claimed that the 
agreement is within about 4%. 
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(14) APPENDICES 


(| (14.1) Field Distribution in the Air-Gap due to a Current through 
a Distributed Winding on either the Stator or the 
Rotor 


For a pole pitch 7, the s.m.p. distribution due to a single full- 
| pitched coil can be expressed in the Fourier form as 


= F (-1)!'2e- Dz os (“) 
n=1,3 
For a double-layer winding, in general, by superposition 
Aj 
a ( ICD Kegan cos (nx ]27) 
a 


Thus the s.m.p. on the stator surface (r = hy) is of the form 


Giz = Di ans Cos (npO/2) 
n=1,3 


‘whilst, by symmetry, the potential of the rotor surface may be 
| treated as zero. The scalar potential at any point (r, @) in the 
| air-gap must satisfy the Laplace equation in the polar form. The 
: solution for this field is found to be 


Pp 
G,.9 = pais @ns_(p1/2np — hiPr—112"P) cos 4npO 
: (39) 
The radial flux density at any point (r, 0) is given by 
dG, 
Byres im Ho (40) 


whence By. = (B,)r=n, and By, = (B,)r=n, 

In the case of a current through a distributed winding on the 
rotor, the s.m.p. can be defined for the rotor surface in a similar 
manner and the flux density can be computed. On this basis 
the flux densities at the rotor and stator surfaces are given by 


: aa np 
Bry = Po > 0 ap ph SP hiP ) cos = 
9 (41) 
land By = YD ee Sn [h} 2reAs/2” 0] cos 


(14.2) Field Distribution in the Air-Gap of a Cylindrical-Rotor 
Machine due to a Single-Turn Winding on the Rotor 


For computing self- and mutual inductances between rotor 
circuits and between rotor and stator circuits it becomes necessary 
to find the field distribution when a current is sent through a 
circuit such as the rl. By arguments similar to that used earlier, 
a zero s.m.p. being assigned to the stator surface, the rotor surface 
can be regarded as having a potential distribution similar to that 
shown in Fig. 3, where 6’ is measured in mechanical degrees and 
#ne coil sides are at PP’, the origin being taken at the centre of 
the coil. The coil spans an angle equal to k rotor-slot pitches, 
namely 2k7/S. 
This potential distribution, in the Fourier form, becomes 


= ) a,,cos mi’ 
n=1,2,3 


where a,,, = (2i,4/7m) sin (mk7]S), since Sis = pa 


{ 
aoe 


Fig. 3.—Scalar-magnetostatic-potential distribution due to a current in 
the rl circuit of the rotor. 


Solving the Laplace equation for the air-gap, 


m 
‘éE eS hi Qin 
ry 2 
m=+1,2 (ng or hy”) 


At the rotor surface the flux density is given by 


(hg™r-™ — r™) cos mb’ 


MQ pr, res ie , 
By = Si pg am 0B"! + HE"-P] cos m6 (42) 
m= 2 1 


(14.3) Influence of Skew on the Inductances 


Consider a squirrel-cage rotor with bars skewed through an 
angle of x electrical degrees. The average flux density along a 
line parallel to the axis on the rotor surface may then be deduced 
from the following considerations. Let this line make an angle 
0’ with the axis of the rl circuit, the term ‘axis’ meaning a straight 
line on the rotor surface parallel to the axis of the shaft and 
passing through the mid-point of the circuit. Then the flux 
density along this line due to a circuit with unskewed bars 
having its axis at an angle @ mechanical degrees away is 
ma 


tn [mhz | + AY") cos mB’ — 8) 


By = Po » 2 (3 — hi”) 


The average flux density along this line due to the skewed coil 
may be regarded as the average flux density due to a uniformly 
distributed winding each side of which extends over x electrical 
degrees. This average flux density at 6” is 


x/p 
P = MA Le 2 x : 
By d= > OF [Ah + h2"—Y]k sq Cos mO 
—x/p ; 


where k;s5,, = sin (mx/p)/(mx/p), the skew factor for the mth 
harmonic. 


The flux linkage with the rn circuit, assuming it to consist of 
parallel sides, is the integral of the above over the range 
— 2n — 1)r7 —k 
2(n 2 + kr and (n 7 
43 + 13) Kn Hee —1)7 Be mka 
Ho 2m 2m Os ee ) 
paid, Ui ae net 


This yields 


AR COS ee (say) 


The flux linkage with a circuit, per metre length of rotor, with 
conductors parallel to the rotor axis, and at an angle 6 away 
from the above is 


co ay 
> Amr COS m |e ky + | 
m=1,2 S 


Thus the flux linkage with the skewed circuit, which may be 
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regarded as equivalent to a single turn with each side distributed ein = : ; F 
over a geometrical angle of (2x/p) is Wihehemm ioe aes ( 1 — cos , on simplification. 
x|p nae uae 
PD co By arguments similar to the above it is clear that the average 
x Dy Aye COs pes lee eA gap flux-density distribution of the gth harmonic linking with a 
es stator phase is now merely multiplied by the skew factor for 


sal og “te = Da l)a (43) the gth harmonic, yielding eqn. (37) for the inductance between 
pe stator and rotor circuits. 
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SUMMARY 


If a voltage z(t) is applied to a network of admittance* Y, the 
resultant current I(t) can be obtained by three different methods: 

(a) By symbolic calculus, in which an image or transform V of v(t) 
‘is found and I(t) is determined by applying the inverse transform to 
the product VY: this leads to difficulties in the f.m. case because V 
pcannot be conveniently formulated. 
(b) By the convolution integral derived from v(t) and the response 
) y(t) of the admittance Y to unit impulse. Well-known formulae are 
‘here derived by this method, together with an additional formula 
napplicable to a typical amplifier-discriminator system in which the 
‘distortion imposed by the admittance is separated from mere delay. 
(c) By operational calculus, in which the admittance Y is regarded 
jas an operator, a function of d/dt, and the input voltage v(t) is the 
coperand. The same formulae are here derived more simply by 
‘@nerational calculus. 


(1) INTRODUCTION 


If a voltage v(t) is applied to an admittance Y, the formal 
solution in terms of symbolic calculus (or the Fourier integral) 
‘amounts in essence to finding first the image (or the spectrum) 
‘of v(t). For a f.m. signal, however, this image (or spectrum) 
‘consists of a very complicated series of sidebands involving 
| Bessel-function coefficients. The analysis does not yield a simple 
‘approximation like the ‘quasistationary’ solution derived by 
‘van der Pol,! and will not be discussed further. 

An alternative approach, discussed in Section 2, is to deter- 
‘mine the response y(t) of the admittance Y to unit-impulse 
input. The resulting current J(t) is then the convolution integral 
of v(t) and y(t). This approach leads easily to the quasistationary 
‘solution already mentioned. The weakness of this solution is 
that it does not in itself distinguish distortion from mere delay; 
a way of allowing for the delay content of the admittance Y is 
proposed in Section 3, and applied to a typical amplifier and 
discriminator. 

A third possible method of attack is to regard Y as an operator, 
a function of d/dt, and v(t) as the operand. Operational calculus, 
as opposed to symbolic calculus, is somewhat out of fashion at 
present, and the writers believe that this eclipse is not justified.? 
It is used here, in Section 4, to derive all the results obtained in 
Sections 2 and 3 more simply. Questions of convergence are 
best considered in terms of operational calculus, and are briefly 
discussed. 


(2) FORMULATING THE PROBLEM: EVALUATING THE 
RESPONSE IN TERMS OF THAT OF THE NETWORK TO 


UNIT STEP 
We shall take the input signal to be 
v(t) a= eloot F(t) _ eloottivfe@dt =s eJoot + jst) : (1) 


where the carrier frequency w/27 will be taken as 10-7 Mc/s 
is numerical calculations while the maximum frequency devia- 


* Admittance may be generalized to transfer function. 


Correspondence on Monographs is invited for consideration with a view to 


p blication. . e ; d 
Mr. Head and Mr. Mayo are with the British Broadcasting Corporation. 


tion /27 will be taken as 75kc/s, so that o (=4/w 9) is a small 
quantity of order 0-007. In eqn. (1) |g(4)| < 1; g(t) depends 
upon the programme being transmitted, and we may therefore 
have available only statistical data. We do know, however, 
that the programme is unlikely to contain any appreciable com- 
ponents at frequencies higher than 10kc/s. This means that the 
maximum numerical value of g’(t) is small compared to pw; the 
maximum numerical value of g’(t) is small compared to p, 
and so on. 

Now suppose that the response of the network of admittance 
Y to unit-impulse input is y(t); we shall consider later the form 
of Y likely to occur in practice. The current /(t) in the network 
when any signal v(t) is applied is given by 


Oke i Ree ews ee ee 
0 


The convolution integral, eqn. (2), can be derived from ele- 
mentary considerations in the time world alone which do not 
require the use of symbolic or operational calculus. We are 
not interested in the transients caused by starting the signal v(f) 
at zero time; we prefer to consider v(f) as a stationary time series. 
As the impedance Y necessarily has a dissipative character, any 
transients due to switching on the signal will have disappeared 
if ¢ is regarded as sufficiently large. The simplest way to allow 
for this is to replace the upper limit of integration in eqn. (2) 
by ©0, and to ignore any finite change in the lower limit. Thus 
the modified version of eqn. (2), to allow for the fact that we 
consider v(t) as a stationary time series and are not interested in 
transients associated with a particular time of switching-on, is 


KG i w(t — A)y(AjdA £0 SRE) 
0 


We regard eqn. (3) as our basic starting-point, from which all 
the results we require will be derived. 

A general formula can be obtained by expanding v(t — A) asa 
Taylor series in powers of A. We shall assume that the series is 
convergent. Conditions of convergence are clearer in the 
operational form of solution discussed later. Substitution of the 
Taylor series in eqn. (3) gives 


eat d’ 
mo =| =, {pony an Le 


We also assume that we can reverse the order of the limiting 
processes in eqn. (4), and that we can integrate term by term, 
so that 


2A CLIO EY 
NOES Dy rere J yay — Ayan a 
Now consider the integral 
Y¥,(p) =| ePQN-AA ss © 
0 
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which is somewhat more general than that associated with a 
typical term of the summation eqn. (5), but reduces to the 
integral involved in eqn. (5) if p is zero; in eqn. (6) p is any 
number independent of A. We already know that Y can if we 
wish be expressed as a function of p in the form 


28) 
Y = | yA)enPdd (7) 
0 
since y(t) is the response of the admittance Y to unit impulse. 
If we now differentiate eqn. (7) with respect to p, the only term 
involving p on the right-hand side of eqn. (7) is €P, so that the 
differentiation under the integral sign merely introduces an extra 
factor (—A); the limits are independent of p. Thus 


GY = | yO(— Near (8) 
dp 0 
and repeating this process gives 
EX = [ 0)(— dread 0) 
dp 0 
so that, substituting back in eqn. (5) and letting p tend to zero, 
we have 
See es UIA) d'v(t) 
= 10 
Ae 2 ae ie dt" (9) 


Eqn. (10) is a perfectly general result, whatever the nature of the 
input voltage v(t) and the network admittance Y. We have in 
effect only assumed that we knew the formula giving the impe- 
dance of a network in terms of its response to unit impulse and 
vice versa. This does not require us to speculate on the precise 
meaning and significance of p, except in so far as questions of 
convergence are concerned. It is, however, easier to postpone 
consideration of convergence until we consider the operational 
method of attack in Section 3. 
If we suppose that 


v(t) = e/** y(t) (11) 
I(t) in eqn. (3) becomes 
(= eit | eFy(t — d)y(A)dr (12) 
0 


If w(t) is now expanded in a Taylor series instead of v(t), we can 
reduce eqn. (12) with the aid of eqn. (9) to 


ro ow § ero] a 
r=0r!L dp" |p, dt” 
Egn. (13) is a well-known result, given, for example, by van der 
Pol! [his eqn. (34)]. It is applicable to all types of modulation. 

For the particular case of frequency modulation, where v(A) 
is v(t) given by eqn. (1), it is convenient in eqn. (12) to take 


w(t) = 6 


(13) 


k = wy 


(14) 


Alternatively, we can in this case expand s(t — A) by Taylor’s 
theorem instead of w(t — A). This gives 


ice) 
I(t) = fet I E Jody) drgisO As‘) +4)228'"()—. ne 
) 


ioe) 
= ghoot +s | EINoot sO (V)drgtis/O+ ++ (15) 
0 
If in eqn. (15) we replace the last exponential by unity, we obtain 
the quasistationary approximation 


I(t) = Y(p)eioottisO , (16) 
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where p is replaced by j[w9 + s‘(4)], that is, p/27/ is the ‘instan-— 


taneous frequency’. This result was also obtained by van. der 
Pol. The first terms neglected by the quasistationary approxima- 
ation are those in which the last exponential factor of eqn. (15) 


is replaced by 
A pr2s(t) + 4039’) (17) 


and, when the terms in eqn. (17) are allowed for in eqn. (15), 
eqn. (16) becomes 


1) =[¥@) + HY" Os" +BY Ds" +. Jelont (18) 


where p is replaced by j[wo + s’(4)] after differentiation. Now 
I(t) in eqn. (18) differs from a constant multiple of v,(4) but the 
difference is not necessarily all distortion. 
was a constant multiple of x(t — 7), where 7 was constant, the 
response I(t) would be merely delayed by a time 7. This is not 
distortion. 
separate the distortion from the delay, and this involves analysis 
of Y(p). For the type of admittance likely to occur in practice, 
however, it appears to be advantageous to replace the quasi- 


stationary solution by a different type of solution in which a 
suitable delay factor is separated out from Y(p) at an earlier 


stage. 

It will be helpful, however, first to consider the question of 
frequency changing. We have seen that if the signal represented 
by eqn. (11) is applied to the network of admittance Y, the 
response I(t) is given by eqn. (12). Now the envelope J)(t) of 
the response is I(f)e~/**, so that 


I(t) = | ew(t — NyQAA (19) 
0 


Now suppose the frequency is changed by K/27 cycles per second, 


so that the applied voltage, instead of eqn. (11), is 


Ot) = JE&+My (1) . 


and that the impedance Y is replaced by Y, in such a way that 


the envelope is unaltered. The resulting current is 


I(t) = eik+Kt | eIE+ Kt — Ny, AGA... 2D 
0 

where y(t) is the response of the impedance Yj to unit impulse. 

The envelope J)(f) is obtained by omitting the factor e/*+) from 

eqn. (21). If Jo(t) = Jp(t), we must have 


If, for example, I(t) — 


Further examination of eqn. (18) is necessary to © 


(20) | 


eFRy,(A) = yA) (22) 
Combining eqns. (7) and (22), we deduce 
Y,(~) = Y~+JK) . (23) — 


Now the response of a band-pass filter is generally expressed, 


at least approximately, in terms of the deviation of the frequency 
from the mid-band frequency, and eqn. (23) simply means that 
the effect of frequency changing is to replace one band-pass 
filter by another of equal bandwidth etc., only the mid-band 
frequency being altered. This changed mid-band frequency can 
be reduced to zero; the band-pass structure then becomes the 
equivalent low-pass structure. This happens if the admittance 


Y is a band-pass filter having mid-band frequency Wo/27m and | 


K is taken as wo. We then know that Y can be expressed as a 
function of (p — jw), say 


Y = Y,(p — jw) (24) 


so that after frequency-changing which reduces the mid-band 
frequency to zero, the effective impedance becomes Y,(p). 
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In practice Y is not strictly a function of (p — jw) but rather 
of (p/w + wo/p).* If we consider a f.m. signal of the form 
of eqn. (1), however, and write 


P=jay tx) (25) 


then x cannot numerically exceed the small quantity lwo = o, 
which is of order 0-007 if the maximum frequency deviation 
Hl2m is 75kc/s and wo/27 is 10-7Mc/s. Substituting from 
eqn. (25) and expanding in powers of x, we have 


Eees 


so the error in neglecting all but the first term of eqn. (26) is of 
the order of $c or 0:35%. If this error is neglected, eqn. (26) 
means that Y can be regarded as a function of jx or (p — javp)/a9, 
in such a way that the original band-pass admittance is replaced 
by the equivalent low-pass admittance for the signal exp [is(2)]. 
Thus what we really require to calculate is 


P , &% 


— = 2jx|1 —4x+4% 26 
Wo D ix 2 2 (26) 


ao 
H(t) = | e-DFA)AN (27) 
0 
where f(1) is the response to unit impulse, not of the original 
Dand-pass structure, but of its low-pass equivalent. This low- 
pass equivalent is obtained by expressing the band-pass admit- 
fance as a function of deviation x from the mid-band frequency, 


and then replacing x by p/jw , and the error committed is of 
order 0:35 %. 


(3) EXTRACTION OF A SUITABLE DELAY FACTOR 
FROM THE EQUIVALENT LOW-PASS ADMITTANCE 
We have seen that in order to calculate the effect of the input 
f.m. signal [eqn. (1)] on the network whose admittance is Y, 
we can use eqn. (27), i.e. we regard Y as a band-pass admit- 
tance and replace it by the equivalent low-pass structure 
whose admittance we shall call y(p). The input signal, eqn. (1), 
is correspondingly replaced by exp [js(f)]._ The error involved 
in this simplification is of order 0:35°% at most. 
In practice, the form of this low-pass admittance equivalent 
to the given band-pass admittance is not in essence different from 


2 
REEL Jae ek 
m(p) = 1/202(1 + a oe) (28) 
Since = eP/*%0 = 1 + paws + p*/2a?we +. (29) 
we shall write 7(p) in eqn. (28) in the form 
n(p) = €PIof1 + f(p)]/2x? (30) 
3 4 
Dp P 
= —plaw 
where $(p) (Gams aE Docent +... Je Pleo (31) 


so that if |p| is small compared with aw, |A(p)| is small com- 
pared with unity, |(p)| is small compared with 1/aw, and 
|6’(p)| is small compared with 1/a?w?, but |¢’’(p)| is of order 
1/o3.w§. 

If 7(p) were not given strictly by eqn. (28), eqn. (31) could 
eontain an additional small term in p?, and eqn. (33) would 
then contain a corresponding additional term in d?w/dt?. 

Now, corresponding to the first term (1/2«) exp (—p/ow) in 
eqn. (30), f(A) in eqn. (27) will have a term (1/2«?)U(t — 1/cws), 
where U(x) is the unit step-function, zero for x < 0, 4 for x = 0 

* In the case (usual in broadcast receiver design) of so-called transformer coupling, 
*4e circuit is not confluent, that is p/« 9 may occur without a corresponding term in 


s9/p. In such non-confluent terms, p/wo can be replaced by j and wo/p by —Jj with 
#n error of, at most, /@o or 0°7% for the f.m. conditions here considered. 


S11 


and 1 for x >0. It is equally well known that if the response 
of an admittance (p) to unit-impulse excitation is :/(f), the 
response of the admittance e~Y?d(p) is X(t — y). In order, 
therefore, to evaluate eqn. (27) when f(t) is the response of the 
admittance (30) to unit-impulse excitation, we have only to 
consider the response of the terms 

p [Gceae, p*|2Aatws .. . (32) 
to unit-impulse excitation. But the response of an admittance 
p’ to such excitation is d’U(#)/dt’, and therefore substitution in 
eqn. (27) and repeated integration by parts finally gives 


1 shee ko 
1 O= sal me — 1faw.) + éxito} dt w(t — Lag) 
i d* 


+ w(t — 1fowo) +.. j (33) 


24atag drt 
Egn. (33) and other formulae for output current like eqns. (10) 
and (18) have been derived by assuming only that if we know 
the admittance of a network we know the response of that net- 
work to unit-impulse excitation; otherwise all mathematical 
manipulations have taken place entirely within the time world. 
At present, symbolic calculus is commonly used to determine 
theoretically the response of a network to unit-impulse or any 
other excitation, and we have therefore by implication used it 
here, but the meaning and significance of the results is not 
associated with or dependent upon symbolic calculus. Indeed, 
it has become clear to anyone versed in operational calculus 
(in which p stands for d/dt) that all the results obtained above 
could have been more quickly and simply derived operationally. 
Such derivation is therefore carried out in Section 4. 


(4) OPERATIONAL DERIVATION OF RESULTS ALREADY 
OBTAINED 
For operational calculus, we assume that p is equivalent to 
d/dt and p" to d'/dt". Eqn. (2) is then replaced by 


I(t) = YQ) © se Se ee 


so that p’s and ?’s both appear on the right-hand side. Some 
doubt has recently been cast on the validity of operational 
calculus; the authors believe that such doubt is not justified, 
provided that sufficient care is taken concerning the limits of 
the integration corresponding to the operator p~!. This matter 
is discussed more fully in Reference 2, in which the use of opera- 
tional methods has also been validated in problems of the present 
kind in which stochastic processes are involved. Here we shall 
merely assume that any algebraic manipulation of Y(p) and of 
v(t) is permissible, provided only that there is no alteration of 
the order of terms as between p-terms and ¢-terms. Thus, 
starting from eqn. (34), when v(f) is given by eqn. (11) we use 
the fact that by Leibnitz’s theorem 


ple w(t)] = &(p + sky Ww) (35) 
I(t) = &* Y(p + jk)w() (36) 


which is equivalent to eqn. (23). If we expand Y(p) in a 
Maclaurin series (whose convergence is discussed later) and 
operate with the successive terms on v(t), we obtain eqn. (10) 
immediately, and if we expand Y(p + jk) in eqn. (36) in a Taylor 
series we likewise obtain eqn. (13) immediately. Applying 
eqn. (13) to the fm. case where k = wo + s(t)* and w(d) is 
exp j[s() — ts‘(Q], we obtain the quasistationary approxi- 


* The fact that k depends on t does not debar it from being regarded as constant, 
since it is independent of p. 


to obtain 
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mation [eqn. (16)] immediately. Again, eqn. (27), with due regard 
to eqn. (28), becomes 


Le — se ro “i P(p) w(t) (37) 


from which eqn. (33) is immediately deducible. 


(5) EFFECT OF THE DISCRIMINATOR 


Eqn. (33) appears to be the most useful form for I(t) in the 
f.m. case, when the signal is applied to one stage of the ise 
coupling circuits, and the effect of a number of such stages is to 
introduce additional delay and to modify the coefficients of the 
differentiated terms in eqn. (33) without greatly affecting their 
relative unimportance. Now the discriminator can be regarded 
as presenting an admittance 


¥(p) = (4 + =2)/Dw) . (38) 
Wo Dp 

where D(p) can be treated as if it were part of the amplifier, 

and therefore omitted from our present discussion. The 

behaviour of one particular type of discriminator is discussed 

fully elsewhere.* The low-pass equivalent of the effective part 


of eqn. (38) is then 


93(P) = 2p/wo . (39) 
so that the current, /,(t), allowing for the discriminator is 
Dein: 
I) = — Sf{e-Pf1 + Acm]} (40) 


Wo dt 


where f is the total delay introduced by the i.f. stages and the 
denominator D(p) of eqn. (38), and A(f) represents the additional 
terms introduced by applying eqn. (33) to these if. stages in 
succession. 


(6) CONVERGENCE 


Since the applied voltage is regarded as a stationary time series, 
we are not interested in transients associated with the fact that 
the applied voltage v(t) is really switched on at zero time and 
therefore preferably written v(t) U(t), where U(f) is the unit step- 
function. We should therefore regard p'K as zero if r is positive 
and K is a constant. The nature of the expression p’v(t), and 
therefore the convergence of a series in which p’v(f) is a typical 
term, depends upon the nature of v(t), which determines the 
effect of p, or d/dt, on it. The convergence or otherwise of the 
series is usually obvious once p’v(t) has been formulated as a 
time function. In symbolic calculus the criteria of convergence 
are less obvious. If we require the time function z(t) corre- 
sponding to a given p-expression {(p), which can be expressed 
as a series of negative powers of p having a,/p° as a typical term, 
it has been suggested that the series La,ts/s! is convergent for 
all values of ¢ with sum to infinity z(‘), provided that there is any 
range of positive real values of p for which the p-series Da,/p° 
converges with sum to infinity C(p). 

In the case of frequency modulation, we are concerned with 


the convergence of eqn. (33), which was derived from the 
expansion of 


EPleo 
1 + (p/ow) + (p?/20w) 


in eqn. (31) as a series in powers of p/aw , which we shall now 
call € for brevity. Now one way of obtaining this series expan- 
sion is to write 


$(p) = i? ReaD) 


1 1 
Wo = , = ee 
me) TER lL geiniA A gy snis 
(1 saga / ya + <paée j ) 
= a eitl4 eo Jtl4 
ine (42) 


ace 1 : 
Seasiite jl fe faeces 
1 + 738 | vs 


and expand the last member of eqn. (42) in ascending powers 
of €. The expansion is convergent for |&| <+/2 if € is an ordinary 
number; since the exponential series for exp € is convergent for 
all €, the series for f(p) obtained from the product #(£) exp é 
is convergent also for |€| <<+/2. But € is not.an ordinary 
number; €’ is proportional to d’/dt” in eqn. (33), so we have to 
consider next 


d'[eO Yar" (43) 


which can be obtained explicitly. 
function of ¢, and z,;, z)... z, are the first r derivatives of z 
with respect to f, 


Y= 55 = &[4i + @zj-*z2 + @zyz3 +... +z 


+ 3()z{7423 + 10(6)24-°z5z3 +...] 
' 


where (*) denotes the binomial coefficient Ma pee 
sir —s)! 

Terms involving squares and products of derivatives of z above 
the first only arise for r> 4. This series is finite: all combina- 
tions of 22z9z% . . . appear such that p + 2qg + 3k...=r. 

Now s(¢) contains frequencies up to 10kc/s but mainly in the 
region 100c/s—2 kc/s, whereas the maximum frequency deviation, 
[277, is 75kc/s. We can therefore reasonably take the maxi- 
mum numerical value of z, to be (0:05)—!y". If all the z, 
terms had their maximum numerical values simultaneously and 
in phase, the first row of terms in eqn. (44) would make a total 
contribution to |y,| of 


C, = p’[20 x 1-05" — 19 — r] 


(44) 


(45) 


There is a very large number of the remaining terms of eqn. (44) 
for large values of r, but they involve at worst powers of 0:05 at 
least as high as the second, and this would appear to prevent 
the total contribution of such terms from being comparable with 
C, in eqn. (45). For purposes of convergence, only the term 
20(1 O52)” in eqn. (45) matters. It means that the general term 
of the series [eqn. (33)] is numerically less than 20 times the 
general term of the series S obtained by substituting 1-O5u for 
p in eqn. (41). The corresponding value of € is about 0-35 if 
« is of order 30 = 3u/wWo, and 0:35 is much less than 4/2. 
Hence the series S, and therefore the series in eqn. (33), is 
rapidly convergent. 
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SUMMARY 


A theory of N-component operators is given for steady-state calcula- 
tions in linear circuits with inputs of periodic, but non-sinusoidal, form. 
If the input waveforms have a finite number, M, of harmonic com- 
ponents, NV may be chosen such that an exact theory results. The 
theory supplements an approximate one of the same nature recently 
published.4 The central feature of the method is the decomposition 
of any periodic function with M harmonic components into a number 
of samples of the same form, but of magnitudes equal to the ordi- 
nates of the function at equal intervals t throughout the period, and 
displaced in the time-axis by successive intervals +. By appropriate 
choice of the sampling function, an exact representation may be 
obtained. 

A shift operator u is now introduced which translates any periodic 
function one interval t to the right. By this means it is possible to 
express a periodic function with M harmonics as a product of a 
polynomial in uw and the basic sampling function. This polynomial in 
u, of order N — 1, gives an N-component operator representing the 
waveform. The representation is, in a sense, analogous to that used 
in vector algebra, where an arbitrary sinusoidal wave may be written 
as the product of a polynomial in j, such as a+ jb, with a basic 
sinusoidal reference wave. The theory gives rise to the fundamental 
‘results vwN = 1 and uN = — 1 for general, and odd harmonic only, 
waveforms respectively, and using these a standard form for an 
N-component operator may be obtained. 

Differential and integral operators are developed and rules are given 
for obtaining the N-component operator corresponding to an impe- 
dance, admittance or transfer function. The algebra of N-component 
operators is summarized and the solution of a simple circuit examined 
by way of illustration. The process of inverting an operator requires 
the solution of N simultaneous equations, and an alternative method 
of procedure is considered. Simple expressions result for power and 
r.m.s. values. 

The application to linear circuits is advantageous in the case when 
the input is known graphically or numerically and a similar description 
of the output is desired. It is therefore an alternative method of 
procedure to Fourier analysis, calculation of the responses to the 
harmonic components, and subsequent point-by-point plotting of the 
output. The theory may also be applied to the solution of non-linear 
circuits by continued approximation, but, as this technique is the same 
as for the approximate operators already dealt with, it is not considered 
in the paper. 


(1) INTRODUCTION 


Of recent years there has been much interest in the calculation 
of network and system responses by methods based on the 
expression of time functions by a series of numbers representing 
the values of the function at a number of equally spaced points. 
The procedures, which are known generally as ‘time-series’ 
methods and which are now well established, have been 
developed in a number of different forms, each suited to the 
particular problem under investigation. The essence of all the 
‘methods, however, is the use of an interpolating or sampling 
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function which is taken in varying amounts at equally spaced 
instants, and the time function is built up by summing these 
varying amounts. 

There are two main features which result, depending on the 
shape of the sampling function or unit chosen, and on the manner 
in which the time origin is introduced. These are the accuracy 
of representation and the emergence of ‘single-sided’ or ‘double- 
sided’ series. By a single-sided series is meant one having a 
definite starting point, usually the time origin, and developing 
to the right as time increases. A double-sided series, on the 
other hand, has a central ordinate, again usually at t = 0, and 
develops in both directions. Both types contain an infinite 
number of terms, which, for stable responses, will become smaller 
and smaller. The distinction between single- and double-sided 
series appears most prominently in the rule for the division of 
two series. In the former, polynomial division is valid, but in 
the latter it is invalid, at least in its normal form. In conse- 
quence, in the second case, an inversion process takes its place, 
and this is in general a more complicated process to carry out. 

The original work of Tustin! on time-series introduced as 
sampling function the ‘delta’ unit, i.e. an isosceles triangle of 
unit height and base twice the interval between pulses, and other 
pulse shapes were also given. The theory, which was concerned 
with functions and responses zero for negative time, gave rise to 
single-sided series and was approximate in principle. In another 
field of study, Shannon? showed that any band-limited time 
function could be exactly represented by its values at sampling 
points equally spaced 7sec apart, where 27 is the period of the 
highest frequency component present in the time function, 
provided that the sampling function chosen was the pulse of 
shape sin (zt/7)/(7t/7). Thomson? chose this shape of pulse for 
his theory of time series, which is therefore in principle an exact 
theory. Further, the particular problem of waveform trans- 
mission with which Thomson was concerned led to his adoption 
of double-sided series, and the resulting algebra differs from that 
of other methods in substituting inversion for division. All the 
methods of time series, however, have been devoted to problems 
of transient response, for both linear and non-linear systems. A 
periodic analogue of the technique was introduced recently by 
the author,’ but the general forms and rules which were found 
to apply indicate that a more correct description of this develop- 
ment would be an N-component algebra of generalized steady- 
state operators. In Reference 4, an essentially geometric 
approach was employed, and the interpolating function, instead 
of being a non-periodic delta unit, was a periodic delta-unit 
waveform. The theory was in consequence approximate, and, 
further, division of operators was accomplished by inversion of 
the divisor. The extension that is presented in this paper is that 
the theory is now made exact. This is brought about by the 
use of the periodic sampling function sin (zt/7)/sin (7t/N7), 
where Nr is equal to the half-period or complete period, depend- 
ing on whether the periodic function being represented has odd 
only, or odd and even harmonics in the waveform. In each case 
N is the number of ordinates at intervals 7 throughout the half 
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or complete period respectively, and 2N7/(N — 1) is the period 
of the highest frequency component. 

Apart from different rules for forming the differential and 
integral N-component operators, the algebra is the same as for 
the approximate theory. Use is made of a shift operator u 
which translates the waveform one interval 7 to the right, and 
impedance, admittance and transfer-function operators are 


N-1 
obtained in the general form A= >) A,u* as before The 
k=0 


fundamental relations v¥ = — 1, and uN = 1 are obtained. As 
the application of the exact theory differs only slightly from the 
application of the approximate theory, only a few remarks con- 
cerning this are included, and further information may be 
obtained from Reference 4. 


(2) N-COMPONENT OPERATORS 


The theory about to be presented is concerned with periodic 
functions having a finite number of harmonics. If this number 
of harmonics, including the fundamental, is M, it will be shown 
that it is possible to express the wave exactly by 2M independent 
numbers in conjunction with an appropriate sampling function; 
further, that these 2M independent numbers are the ordinates 
of the wave at equal intervals 7 throughout the period, or, in 
the case of waves having only odd harmonics, throughout the 
half-period. It is reasonable to expect this result as, from the 
viewpoint of the harmonics, 2M independent numbers are 
required to state their amplitudes and phases. 


(2.1) Basis 


The sampling functions chosen are S(t) = sin Nwt/(N sin wf), 
where N is even for those waveforms having only odd harmonics, 
and R(t) = sin+Nwt/(N sin 4wt), where N is odd, for periodic 
functions in general. The trigonometrical expansions of these 


are 
2 
SO w lees wt + cos3wt +... +cos(N — 1)wt] (1) 
and 
1 2 
IAG) = W | wicos wt + cos2wt +... + cos$(N— 1)wt] . 2) 


There are M = $N components in the first of these expressions 
and M = 4(N — 1) in the second. The functions are shown in 
Figs. 1(a) and 1(5), in which, for comparison, are given also the 
periodic delta unit waveforms corresponding to them. The 
zeros of S(f) occur at t = km/(Nw), k =1,2,...,2N— 1, with 
the exception of k = N, when S(t) = — 1. The zeros of R(f) 
occur at t = k2n/(Nw), k = 1,2,...,N—1. Att=0, S(t) 
and R(t) both equal unity. It follows therefore, that if we are 
proposing to sample any waveform /(f), having only M harmonics 
including the fundamental, using the function R(t), a choice of 
the sampling instants equal to t = kr = k2m/(Nw) will result 
in a contribution from only one sample at any particular sampling 
instant, since at that instant zeros of all the other samples occur. 
Analytically this may be shown as follows. 

Let f(t) be written as the sum of a number of samples, 
using a) delayed by successive amounts 7. We have then 


N= ~, a,R(t-— kr). By setting t = kr,k =0,1,...,N— it 


in turn we have a, = f(kr). 
Skt) = fy, we have therefore 


Writing the ordinate values 


ESN 
SO = E feRlt — kr) 3) 


(a) 


Fig. 1.—The sampling functions S(t) and R(f). 


(a) S(t) = (sin Not)/(N sin wt), for N = 6. 
(6) R(t) = (sin 4Not)/(N sin $1), for N = 11. 


The wave is represented exactly by its ordinates at the N instants 
t=0,7, 27,...,(N—I1)t. If f@) has no d.c. component, 
however, these NV numbers are not independent. Remembering 
that R(t) has average value 1/N, the additional relation obtained 
from eqn. (3) is 


N-1 
x AN =90 (4) 
k=0 


so that in all there are N — 1 independent numbers representing 


the wave. This is the same as 2M, i.e. twice the number of 
components present. The case of only odd harmonics is similar. 
We have 


N-1 

fO= >, SiS — kr) . (5) 
where tT = z7/(Nw). 
in this case. 

The above results are equivalent to the statement that a wave 
containing harmonics up to the mth requires, for its exact 
description, N = 2m + 1 equally spaced ordinates throughout 
the period. If there is no d.c. component, 2m of these numbers 
are independent; if there is a d.c. component the 2m-+ 1 
numbers are all independent. Further, if only odd harmonics 
up to the mth are present, the number of equally spaced ordinates 
throughout the half-cycle required for the exact description of 
the wave is N= m-+ 1. 

For example, the wave v(t) = cos wt + 0:S5cos3wt +0:5cos Swt 
has three odd harmonic components, the highest of order 5. It 
can therefore be represented exactly by eqn. (5) with N = 6. We 
have + = m/(6w) and V;, = v(k7), kK =0, 1, 2,..., 5, ie. V;, 
is the set of six numbers 2-000, 0-433, 0-250, 0, —0-250, —0-433., 
The following Section shows how this set of N numbers can be 
formed into an algebraic N-component operator. 


There are N = 2M independent numbers 


(2.2) N-Component or U-Operators 


A theory of N-component operators may be developed on the 
above basis by introducing a fundamental operator u, whose 
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effect is to translate any time function to the right by one 
interval 7, that is, uf(t) = f(t — 7). In general the relation is 
ukf(t) = f(t — kr), where k is any integer, positive, negative or 
zero. It follows therefore that u~*ukf(t) = (1), i.e. u-* is the 
inverse of u*. For periodic waves in general, with the complete 
cycle divided into N parts, we have 


uN = |} (6a) 
For odd harmonics only, Nr = z/w, and 
uN = —1 (65) 
Using this operator eqns. (3) and (5) become 
N-1 
{O= 24, eg hee ee rte (7) 
N-1 
fO = >> UES ocean rma 6:9) 
These may be written in the simple form 
f= 2 fur’ es eee et at. (9) 


where k is summed* over the integral values from zero to N — 1. 
Eqn. (9) may be termed the operator form of the waveform f(t), 
in the same way that 


a Voj° aia V7 = Vo + IV, 


is the operator form of the sinusoidal wave corresponding to V. 

Further, any expression such as a + bu* + cu® + ..., con- 
taining a finite number of terms, may be reduced to the form 
>) 4,u*, where some of the A; may be zero, by the substitution 
k 


uN = 1 or uN = — 1 as the case may be. This is analogous to 


reducing a + jb + j*c + jd, for example, to (a — c) + j(b — d), 
by substituting 7? = —1. The forms }) 4,u‘, fu’ will be 
k 7 


termed N-component operators, or u-operators for short, and 
denoted simply by A, f respectively. 


(2.2.1) Differential and Integral Operators. 


In this Section an operator p is developed such that p operating 
on a waveform /(f), written pf, results in an operator representing 
the waveform f(t), where f(t) is the derivative with respect to 
time of f(t). This may be found as follows. We have 


IO= 2 A,.RG — kt) = 2 fuk R(t) 
and 


I= pa f, R(t — kt) = p2 Sut RXt) 


Since R(t), the derivative of R(t), contains the same number of 
harmonics as R(t), it may be written according to eqn. (7) 


RS aR ERG) « (10) 
k 
where R, is the ordinate of R(t) at t = kr. 
By substitution in the expression for f’(‘), therefore, 
f= DY AukRO = Fu Vs Ryw)R(t) . (11) 
k j 


or, in operator form f’ = fp. The differential operator is 


‘herefore 


p => Rul = > Ryu (12) 
1 k 
* Unless stated otherwise §) will in future denote summation with respect to k 


k F ; 
sver the N values from zero to N — 1. Similarly for any other index letter written 
n this fashion. 
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Similarly for waves having only odd harmonics, the differential 
operator is >) S,u*, where S; is the ordinate of S(t) at t = kr. 


By applying these results, the following operators are obtained: 


General waveforms 


N-1 
p=i0 [(— 1) cosec (kar/N)]u* (Nodd) . (13) 


Odd harmonics only 
N-1 

p=w > [(—D)* cosec(ka/N)]u* (Neven) . (14) 
k=1 

An alternative series form for each of the coefficients Ry and S; 


may be written, namely 
ey (N=N/2 


R= W > msin(2mkr]N) . (15) 
m=1 
Dip aa : 
S.= W x msin(mkr]N) . (16) 


m=1,3..:. 


The summation in this last result takes place over only odd 
values of m. Note that there is no coefficient of u® in the 
operators, eqns. (13) and (14). The differential and integral 
operators for several values of N are given in Section 6. 

The integral operators are calculated in an analogous fashion. 
Integrating 


1 2 
R(t) = ah + = [cos wt + cos 2wt 


+ cos 3wt + + cos 4(N — l)wt] 
2 W=DI2 sin mwt 


[R@ a poe 


gives 


m=1 m 
Then we have 


oO — Di fcRE ht) 2 Sit RO) 


Let us write the periodic part of this as W(?). 


and 


| $0 = Eh | R(t — kr)dt = 2 f,.Wt — kr) 


since >) f,t/N is zero for waveforms with zero average value. 
k 


Hence J fOdt = % fu WO = fl Wu!) R() 


N-1 
The integral operator is therefore p-! = 3) W,u*, where 
k=1 
2 W=DI2 sin 2mka|/N 


= N odd) . 
No it (N odd) 


W,, (17) 


For waveforms with odd harmonics only, writing | S(Hjdt = V(t), 


N-1 
where V(t)=(2/Nw) > (sin mwt)/m, the operator is 
m=1,3... 


N-1 
pte SV ak 


k=l 
where 2 sin mka]|N 
V, = — — 


ING = 1e3ee. m 


INSI 


(N even) . (18) 


The summation in the above expression takes place over odd 
values of m only. 
(2.3) Operation on a Waveform 


From the manner in which the differential and integral 
operators have been developed, it will be seen that an operator 
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A operates on a waveform f according to the simple tule of 
polynomial multiplication. If g(‘) is the waveform resulting 
from the operation of A on f(t), we have 


f(t) =X fk RO) 
k 

g(t) = Dg RO 
k 


where ¥) g,u* =(3 fw) Aju’), the product on the right-hand 
k ; / 


J . . 
side being subsequently reduced by the substitution leew 


have 
Af=fA 


=D fuiS Aue! 
ij 1 
= DY KAuit! 
Hal 
= D FA, ju 
k,j 


That is, the Ath component of g is g, = DG An j= GAN +k ae 


J Aj 
For example, go = fo4o + fiAn-1 + frAn-2 +--+ - +fy—1A1- 
The substitution A,_; = An,,-; is made for all k —j <0. 
For the case of waveforms having only odd harmonics 
Ay_; = — Ay +x; since uN = — 1. 

For example, the current waveform i(t) = sin wt + 0:5 sin 2wt 
+ 0-5 cos 3wt has both even and odd harmonics and therefore 
requires a choice of N odd and equal to 7. In operator form 
the current is 


r=) Jui = 05000 + 0-8144u + 1-069 7u? 

; — 0-068 3u — 0-154 1u4 — 0-446 3u5 — 1-7244uS 
Let this current be passed through an impedance given by 
Z => Zu! = 2-:0000 — 1:1523u + 0-639 5u? 

— 0:5129u3 + 0-5129u* — 0-639 5u° + 1-1523u° 


This operator in fact corresponds to the impedance 2 + jl] 
ohms, and has been obtained in the manner indicated in the 
immediately following Section. 

The current J operating on Z gives the voltage drop: 


Za ae 
k 


where the components of V are given by 


Ve yD Zp tor hem 0-102, 2.6 
y 


Further, since u’ = 1 in this case, we set Z,_; = Z74,_,; for all 
k —j<0. That is, 


Vo = 1hZo + Ze + Zs + 13Z4 + 14Z3 + 152, + [gZ, 
V, == IpZ, + Zo a iat LZ, + I;Z; an lca Z4 + I;Z; + [gZ> 


and so on. Numerical evaluation of the above components is 
most easily carried out using a desk calculating machine and the 
‘movable strip’ technique. The resultant or product operator is 


V = 3-000 + 1:376u + 2-196u2 — 1-881u3 + 0-882n4 
— 3:196u5 — 2-3941,5 


(3) APPLICATION 
(3.1) Impedance, Admittance and Transfer-Function Operators 


The differential and integral operators of the preceding Section 
are used to form N-component operators corresponding to any 
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impedance, admittance or transfer function. If we have an 
impedance Z(p), then the N-component operator corresponding 
to this is formed by replacing p by the expression given for p. 
Similarly with admittances and transfer functions. For example, 
the series circuit having impedance R + pL yields the operator 


Z=R+ pl =D Z,u* 
k 


and so on. For N = 6, say (odd harmonics only), the steady- 
state impedance R + jw will give 


R + w(—2:000u + 1-155u2 — 1-000u3 + 1-155u4 — 2-000u°)L 


i.e. an expression of the form }) Z,u*. 
k 


(3.2) Algebra of N-Component Operators 


The algebra of N-component operators has already been given 
in Reference 4. It will suffice therefore to summarize the results. 


(3.2.1) Unit Operator: Addition and Subtraction: Multiplication. 


The unit operator is one in which the coefficients of all powers 
of uw vanish with the exception of the coefficient of uv°, which 
is unity. 

The sum or difference of two operators is formed by taking 
the sum or difference of corresponding components, i.e. 


V4 Sy Be Saez = Boa 
k k k 


The product of two operators is formed according to the rule 
previously given for the operation by an operator on a waveform, 
ic. AB = C, where C, = >) A;B,_;. Multiplication of N-com- 


ih 
ponent operators is commutative, i.e. AB = BA. 


(3.2.2) Inversion: Division. 


Division of operators is accomplished by a process of inversion. 
Suppose A is the operator to be inverted and that its inverse is 
A-!=B. Then AB = 1, the unit operator. This gives the set 
of equations >) A;B,_; = Cy, k =0, 1, 2... (N— 1), where 


J 
Co = 1, and C, =O fork #0. By solving this set of equations 
the required coefficients of B may be found. A formal statement 
for the inverse has been given,* but in practice it is better to 
solve the equations by a short method, such as that due to 
Crout.> None the less there is considerable labour attached to 
inverting. 

An alternative concept of the inverse of an operator for any 
circuit is that it is simply the response of that circuit to a unit 
waveform input, that is, the response to R(f) or S(t), and this 
may be calculated from the known harmonic components of 
these functions. Viewed in this fashion, the response of a 
circuit to any input is therefore a selection of unit waveform 
responses, taken in proportion to the input ordinates at N points, 
and given the correct delayskr, k = 0, 1, 2,...,(N— 1). The 
calculation of the inverse in this fashion requires less labour for 
simple circuits, but for more complex circuits it would be more 
lengthy than the method of solving the equations. 


(3.3) Power and R.M.S. Values 
It is shown below that the power represented by two waves V 
and J is simply (V:-D/N = ( VEIN, being the scalar product 
of the two N-component operators. Likewise the mean square 
value of a voltage V is x V2/N, and the r.m.s. value the square 


root of this. These are analogous results to those giving the 
power in two complex waves of voltage and current in terms of 
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the harmonic powers, and the mean square value in terms of the 
harmonic mean square values. 


If we have V(t) = 2 V,R(t — kr) and I(t) = 2 [R(t — kz), 
the average power is 


2rlo 


(cof2n) | Vol (Hadt 


2r]w 


= (w[2m) JS VileRe — jryRE — ke 
isk 
2rt]@ 
= (wf2m) | VedeRMt — kaa 
= k 


2r/o 


rs (co[2n) | & Vile — FRO kya 


(N—1)/2 
For the function R(t) = 1/N + Q/N) 2s cos mwt, the second 


of these integrals is zero, and the first ee i gives x VAIN. 


A similar result holds for waves having only odd Sree By 
putting J(t) = V(t), the mean square value of a waveform V is 


DEIN. 
k 
(3.4) Circuit Example 


Consider the circuit shown in Fig. 2, having the fundamental 
branch impedance Z, = 1 + j1, Z, = 1 — j2, and Z; = 1 + /3. 


Z3 
WWW OT $0 


Fig. 2.—Illustrative series-parallel circuit. 


Suppose the voltage applied to this circuit is v(t) = cos wt 
+0:5cos3wt + 0:5cosSwt. This is simply an arbitrarily 
selected complex wave having three odd-harmonic components. 
In operator form it is 


V = 2-000 + 0:433u + 0:250u? + 0-000u3 — 0-250u4 — 0-433u5 


where N has been taken equal to 6 in order to give an exact 
representation, and V, is equal to v(k7). 

The operator forms for the impedances are obtained by 
replacing jw and —j/w by the differential and integral operators 
of Section 6, taken for N equal to 6. Thus, using three decimal 
places only, 


Zi = 1 £:1(— 2-000u + 1-155u2 — 1-000u3 + 1-155u*—2-000u5) 
Z,=1+2(0-311u +0-231u? +0-289u3 + 0-231u4 + 0-311u°) 
Z, = 1 + 3(—2:000u + 1-155u* —1 -000u3 + 1-155u+ —2-000u5) 


The admittance is (Z,; + Z)/(Z,Z, + 2,23 + AEN We 

have ae — 2-999 — 1-378u + 1-617u2 — 0-422u3 + 1-617u4 

— 1-378u>, using the multiplication rule given, and similarly for 

2,2; and Z7,. The sum Z,L> + 2,2; + Z3Z, 3 is — 24: 009 

+ 6-034u-+ 6:-163u2 — 6:844u3 + 14-165u+ — 35-546u°. Let 

‘his be denoted by A =) A;,u*, say, and its inverse by 
k 


8 cee 


Then from the relation AB = 1, we have the 
B, = Cy, where C, = 1 fork = 0, and C,=0 


(N — 1). From these equations the coeffi- 
In fact the equations are 


Sriations ~ Aye. 


at 2 ..., 
‘ients B, may be found. 
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— 24-009By + 35-546B, — 14-165B, + 6-844B, 
— 6-163B, — 6-034B, = 1-000 (k = 0) 


6°034By — 24-009B, + 35:546B, — 14-165B, 

+ 6°844B,°-— 6 1638. 0" (kK ="1) 
The result is 
= 10-'0-2264 + 0:5146u + 0-341 3u? 

+ 0-091 6u3 — 0-0929u* — 0-128 6u°) 


and so on. 


Asa 


The admittance is 


Y = 0:0622 + 0-1208u + 0-056 2u? + 0:529u3 


+ 0-021 2u4 + 0:0395u5 
and the current is 


I= VY = 0-168 + 0:296u + 0-208u2 + 0:179u3 
+ 0:081u* + 0:04415 


The above ordinates give the required waveform and the r.m.s. 
value is [2X J?/N]'/? = 0-182. 
k 


From this simple example it will be seen that practically all 
the labour of calculation lies in obtaining the admittance, for 
which it is necessary to invert an N-component operator. In 
view of the nature of the equations giving the unknown com- 
ponents of the inverse operator, i.e. the cyclic type of coefficient 
array obtained, it is possible that a simpler method of inverting 
can be found, and if so, this would be highly advantageous. 

The alternative method of calculating the admittance, i.e. by 
calculating the unit waveform response, has some merit in this 
simple case. The unit waveform is, for N = 6, S(t) = 4 (cos wt 
+ cos 3wt + cos5wft). The response to this, evaluated at 
points t= kr, k =0, 1, 2,....(N—1), is 0-068 + 0-119u 
+ 0:050u2 + 0-055u3 + 0:027u* + 0-037u5, which is the admit- 
tance Y. The discrepancy in the figures is due entirely to taking 
only three decimal places for the operators p and p~! in the first 
calculation, and is not due to any error in principle. 


(3.5) Conditions of Application 


Application of the foregoing theory to linear circuit work 
would be advantageous in the circumstance of the input being 
known numerically or graphically, and a similar description of 
the response being desired. In this case there is no need for 
Fourier analysis of the input, or summation and point-by-point 
plotting of the output. If the Fourier components of the input 
were known and an output waveform was required, there would 
be little to choose between proceeding by the above method and 
by the usual method of calculating the harmonic components. 
The calculation of inverses requires the solution of N simul- 
tanous equations, and it would therefore be fair to say that this 
amount of labour should be compared with the amount necessary 
in a Fourier analysis. There would seem to be the probability, 
however, when large numbers of calculations require the use of 
computers, that the method suggested might be more suited 
than the harmonic components method. 

Although having a limited application in linear circuits, its 
application to non-linear circuits by a method of continued 
approximation seems most advantageous. There are other 
methods, of course, available in this case using successive correc- 
tions to Fourier components, but, generally speaking, these 
only hold for sinusoidal applied forces. This application has 
already been dealt with in Reference 4. 


(4) CONCLUSION 


An exact theory of N-component operators for linear circuits 
has been given, and its conditions of application discussed. The 
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theory is intended to supplement an approximate one of the 
same nature. The relationship of the theory to time-series 
methods has been pointed out. Asa practical method of calcula- 
tion the theory has a limited application in linear circuits, and a 
more advantageous one in the approximate solution of non- 
linear circuits. As a theoretical development it provides an 
alternative decomposition of periodic functions to Fourier series, 
and, while this aspect has not yet been properly examined, it is 
suggested that the theory may be of assistance in investigating 
the stability of non-linear circuits. It is a further interesting 
point to note that, for N = 2, the sampling function S(t) reduces 
to cos wt, the shift operator u becomes j~! and normal vector 
algebra results. That is to say, vector algebra, suitable for waves 
having only a fundamental component, is a particular case of 
the foregoing theory. 
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(6) APPENDICES 
(6.1) Differential Operators 


The following columns give differential operators for values 
of N from 4 to 9. The operators for which N is even apply 
to waveforms having only odd harmonics and are calculated 
from eqn. (14), omitting the factor w. Those operators for 
which N is odd apply in general and are calculated from egn. (13), 
omitting the factor w/2. The factors w and w/2 are to be subse- 
quently inserted depending on the input fundamental frequency. 


k N=4 N=5 N=6 N=7 N=8 N=9 

1 —1-41421 —1-70130 —2-00000 —2-30457 —2-61313 —2:923 80 
2 1:00000 1-05146 1-15470 1-:27896 1-41421 1:55572 
3 —1-41421 —1-05146 —1-00000 —1-02569 —1-08239 —1-15470 
4 1:70130 1-15470 1-02569 1-00000 1:01543 
3) —2-00000 —1-27896 —1-08239 —1-015 43 
6 2:30457 1:41421 1-15470 
7 — 26153 = 555m 
8 2°923 80 


(6.2) Integral Operators 


The following columns give integral operators for values of N 
from 4 to 9. The operators for N even and WN odd are restricted 
as in the previous Section. For N even they are calculated from 
eqn. (18), omitting the factor 1/w; for N odd, from eqn. (17), 
omitting the factor 2/w. 


N=4 


0-471 405 
0-333 333 
0 


N=5 N=6 

1 0-248 990 0-311 111 

2 0-022. 451 0-230940 0-071043 0-175093 
3 0°471405 —0-022451 0-288889 0-052563 0-212941 0-072169 
4 —0-248 990 0:230940 —0-052 563 0:180953 0-007011 
2 0-°311111 —0-071043 0-212941 —0-007011 
7 

8 


N=7 N=8 


0-201 996 0-232 522 


N=9 


0-167 708 


—0-201 996 0-175093 —0-072 169 
0-232522 —0-078 494 
—0-167 708 


0-078 494 — 
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SUMMARY 


An oscillator may be synchronized by an injected signal if the fre- 
quency of the signal is close enough to the natural frequency of the 
oscillator, and if its amplitude is sufficiently large. The output of the 
oscillator then contains only the frequency of the injected synchronizing 
signal. In practice an interfering signal is usually present in the input 

to the oscillator together with the synchronizing signal, 
important property of the synchronized oscillator is its discrimination 
‘against the interfering signal even when the difference between the 
interfering and synchronizing frequencies is infinitesimal. The 
‘response of the oscillator to the interfering signal is investigated under 
‘conditions where the interference component of the oscillator output 
is small; the response is found to be linear, and the discrimination is 
‘expressed analytically. Good agreement between theory and experi- 
‘ment is obtained. 


LIST OF SYMBOLS 


v = Oscillator tuned-circuit voltage. 

«, y = Constants of the differential equation of the oscillator. 
H(t) = Forcing function of the oscillator. 

a = Amplitude of free oscillation. 

@, = Natural (angular) frequency of the oscillator. 

q@, = Synchronizing frequency. 

w, = Interfering frequency. 

A; _ 2(w9 — @) 


O71 = 

a ax 
a. eae 

O (e4 

Ain — 2[@ — 2w, — w2)] _ 
a ee : ue 


01 crit = Walue of o, at which synchronization ceases. 
A= Amplitude of the synchronizing sinusoidal forcing 


function. 

B= Amplitude of the interfering sinusoidal forcing 
function. 

F, = == Awo/agx 

F, = — Bodo[agx 


b;, b> = Sine and cosine components of the output of w,. 
b3, b, = Sine and cosine components of the output of wy. 
bs, bg = Sine and cosine components of the output of 2w, — w. 
pi = (6} + bD/a3 
p2 = (b3 + bal 4 
23. (3 ant B14, 
Dea C= (b} a5 b3)|a 
Pm = d = 2b 1Do/a6 
D = Discrimination of the oscillator against the inter- 
ference of frequency wy. 
V = Diode forward voltage. 
I = Diode forward current. 
a, I, = Diode constants. 


g=1-— 2p, 
Correspondence on Monographs is invited for consideration with a view to 
p» blication. 


The authors are in the Department of Electrical Engineering, Imperial College of 
& ience and Technology, University of London. 


and an. 


(1) INTRODUCTION 


A synchronized oscillator—that is, an oscillator whose fre- 
quency is identical with that of an injected synchronizing 
sinusoid—has a number of practical applications.!»?»3»4 Among 
these are carrier generation in communication systems, the 
reduction of the depth of modulation of an a.m. signal, and the 
improvement of the signal/noise ratio of a noisy signal. 

The synchronized oscillator is especially suited to these applica- 
tions because it has the property of discriminating against a 
small-amplitude interfering signal* which may accompany the 
synchronizing signal. Some discrimination may result from 
the selectivity of the oscillator’s tuned circuit, but this cannot 
explain the fact that there may be discrimination against the 
smaller-amplitude interfering signal even when its frequency is 
infinitesimally different from that of the synchronizing signal. 
In this latter case, the discrimination is due solely to the non- 
linearity of the oscillator. This effect cannot, therefore, be 
explained by a linear analysis of the circuit, and non-linear 
circuit analysis must be used. 

This problem has been studied extensively by Tucker,!:? 3 
and by Tucker and Jamieson,* and many interesting results have 
been obtained. Expressions for the response to the interfering 
signal in a synchronized oscillator were obtained for the cases 
where the interference frequency is (a) far removed from the 
oscillator frequency, and (4) near, but not in, the synchronizing 
range of the oscillator. Tucker? suggests that a general solution 
of the case where the frequency of the interfering signal lies in 
the synchronizing range may not be practicable, and he obtains 
a restricted solution based on the assumption that the selectivity 
of the oscillator may be neglected. This assumption implies, in 
effect, that the synchronizing and natural oscillation frequen- 
cies differ negligibly, or that the oscillator has no frequency- 
determining network. 

In the present paper no such limitations are assumed, and the 
solutions obtained are quite general. The solutions of the non- 
linear differential equation of the oscillator are obtained by the 
method of van der Pol. The ‘phase-plane’ approach of 
Andronov and Witt® is introduced, leading to the same solutions, 
and their method of analysis is used to verify the ‘mathematical 
stability’ of these solutions. 

In Section 2 the oscillator under consideration is introduced, 
and its non-linear differential equation discussed; this equation is 
solved for the case of a single injected synchronizing signal. A full 
treatment of the methods of analysis used is given by Stoker.’ 

Section 3 discusses the behaviour of the oscillator when the 
injected forcing function consists of a synchronizing sinusoid 
and an interfering sinusoid of a similar order of frequency. 
The response of the oscillator is derived for the condition where 
the interference component of the oscillator output is small; 
this leads to a completely general expression for the discrimina- 
tion against the interfering signal. 

* In describing the interfering signal as small, reference is being made to conditions 
at the output of the oscillator. The reason for this unusual definition will be apparent 


in Section 3. : ' : 
+ For an explanation of this expression see Section 2.2, 
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It should be mentioned that, although the injected interfering 
signal suffers discrimination, the oscillator itself introduces 
interference in the form of an intermodulation signal, by reason 
of its non-linear action. The level of this latter interference in 
the output of the oscillator is expressed analytically: it is generally 
lower than that of the interfering signal. 

A transistor oscillator was used in the verification of the 
theory, and excellent agreement was obtained. 


(2) TUNED OSCILLATOR WITH ONE INJECTED SINUSOID 
(2.1) The Oscillator 
The oscillator considered is a tuned oscillator whose forced 
behaviour is described by the non-linear differential equation 
d*v 
dt? 
where v is the oscillator tuned-circuit voltage, wo is the natural 
frequency of the oscillator, and H(f) is the forcing function. 


A physical representation of this equation is shown in Fig. 1(a). 
The parallel tuned circuit is made regenerative by the negative 


aP i ev +yv3)+ee= HH) . . 


CURRENT 
FORCING 
FUNCTION 


(a) 


Re f (i) 


VOLTAGE (NONLINEAR) 


FORCING 
FUNCTION 


ul 
: Sal 


(b) 
Fig. 1.—Simple equivalent forms of the oscillators considered. 
@ HO =F = cs — 6% 4 Ly @ +2 
where f(v)v oc v3 in the case considered. 
® HO =F =L RE + Sons b 


where f(i)ic i3 in the case considered, 


conductance, with limiting effected by the non-linear positive 
conductance. Fig. 1(6) shows a dual circuit representation 
described by the same form of equation but with current as the 
independent variable. This paper is concerned with the analysis 
of oscillatory systems of the general type defined in Fig. 1, the 
voltage differential equation (1) being considered specifically. 
The tuned-grid oscillator is an example of a circuit described by 
this equation (see Reference 7, p. 148). 

It may be argued that the ideal non-linearity considered is not 
realized in practice. It is, however, usual in the analysis of such 
systems to represent the non-linearity by a power series, and to 
truncate it at the cubic, so that the first time derivative of only 
v, v* and v? remains in the differential equation. If such a 
simplification is not made, the solution of the oscillator becomes 
an extremely difficult task. It is considered that such a simplifi- 
cation is a reasonable compromise between utility and com- 
plexity. It has also been assumed that the non-linearity is 
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skew-symmetrical about the quiescent operating point, and hence ; 
the non-linear quadratic term d(v’)/dt has been omitted, leaving ; 
d(yv*)/dt as the only non-linear term. 


(2.2) The Method of van der Pol for Analysis of the 
Oscillator with One Injected Sinusoid 


If the free behaviour of the oscillator is described by the non- 
linear differential equation | 
dy ad 
—, + —(—av + yo?) + wiv = 0 
Ho at’ eae) Y 
where «/w is small, it may be referred to as a nearly sinusoidal 
oscillator, and the harmonic content of its output is small: many 
tuned oscillators can be so described. 
Suppose a sinusoid of frequency w, to be injected, such that — 
the forced behaviour of the oscillator is described by the equation — 
dv | 


d ; 
7 + Progie. + yv3) + OU Awé sin @yt «<5 G 


(2) 


then, if the amplitude A is sufficiently large, and w, is sufficiently — 
close to wo, the output will be a single frequency w,, again 
with little harmonic content. Consequently, it might appear — 
reasonable to attempt to solve eqn. (3) by substituting a solution 


v(t) => by sin wt + by cos wt 7 : 5 < (4) 


This would lead to solutions for b; and by in terms of the 
variables A, w, and the oscillator constants. It may be, how- 
ever, that such a solution is unstable in the sense that, if the © 
oscillation is made to deviate slightly from this solution, for 
example owing to noise or changing oscillator parameters, it — 
never returns, but tends to another, stable, state of oscillation.* 

In order to be able to establish the stability of the solutions, — 
van der Pol assumed that b,; and 5, in eqn. (4) were not constant, — 
but slowly varying functions of time, i.e. 


v(t) = b(t) sin w,;t + b(t)cosw;t . . . (5) 


b,(¢) and b,(¢) can represent the process of divergence from or 
convergence to the solution given by the constants in eqn. (4). 
Because 5,(t) and b,(f) are slowly varying functions of time the 
following inequalities exist: 

by by be 2 by 
Wo Wo Wo Wo 


by by 

a oe  o <bn 6) 
where the dot notation is used to represent differentiation with 
respect to time. 

The method of solution is first to assume that a/wy and — 
(wo — w;)/wo are small, and to specify that the maximum values 
of b; and b, are of the same order as do, the free oscillation 
amplitude, so that it follows from eqn. (6) that 6,/agwo and 


balay are small and that b,/agw% and by/ayw3 are very small. 
Then the solution (5) is substituted in eqn. (3). All terms in 
which the coefficient is very small, or the product of two small 
terms, are neglected, as also are terms of frequency 3w,. All 
the remaining terms contain sin w,f or cos wt, and the next 
step in the analysis is to equate the coefficients of sin w,t and 
cos w,f on each side of the equality sign. This results in the 
following first-order linear differential equations: 


& b? + b2 
2b, ta boA, —s aby (1 — i) = @) Xe 4 (7) 


* A useful analogy is a solid pendulum placed with its centre of gravity directly 
above its axis of rotation; its state is unstable since any slight disturbance displaces it, 
causing it to seek a position of stable equilibrium, with its centre of gravity directly 
below the axis of rotation. 


, b? + b2 we 
2by — by ~ abp(1 een eee 4 
2 — 5,2, — ab, RB ) am ~ — Aw (8) 
with Ay = 2(@ = w,), Ga 4Aafsy «.. . (9) 


where dy is the amplitude of free (unforced) oscillation, and 
where use has been made of the approximation 


Ay & (w — w/a, 


By assuming that b,; and b, are constant (i.e. that the output 
contains only one frequency, w;, and hence that the oscillator 
is synchronized) 5, and 5, may be put equal to zero. Eqns. (7) 
and (8) are thereby reduced to simple algebraic form and may 
be solved to give b; and b, and hence the output amplitude* of 
frequency w). 

The stability of the solutions is determined by replacing b, 
and b, in eqns. (7) and (8) by b, + 5b, and 6, + 545, and 
obtaining linear differential equations for 65, and 555. The 
form of the roots of these equations indicates the stability, i.e. 
whether, in effect, a small disturbance 6), of b, (say) tends to 
increase with time (instability) or tends to zero (stability). 

The results obtained by van der Pol’s method may con- 
veniently be illustrated in a response diagram, as shown in Fig. 2. 
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BUTE 
oF+(1-p,)(1-3 p,)= 0 


Fig. 2.—Response curves of the forced oscillator described by eqn. (3). 


Stable solution. 
—--—-— Unstable solution. 


The region of instability is cross-hatched. 


The co-ordinates and the running parameter have been nor- 
mialized for convenience, according to the following relations: 


a cee a ey Fe een 10 
(27; ee P1 2 > 1 ao ( ) 


* The restriction that the maximum values of b, and bp be of the same order as ao 
is removed if only the steady-state solution for the output amplitude is considered. 
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The abscissa is a measure of the detuning of w, from wo, and 
the ordinate is proportional to the square of the normalized 
output amplitude. Sample response curves for various values 
of the injection amplitude 4, and hence of F;, have been plotted, 
and the region of instability is indicated by cross-hatching. 


The results are described mathematically as follows. The 
(implicit) expression for the response curves is 
poz + pill — py)? = FP (11) 


and the region of instability includes the area p; < 0-5 and the 
area within the ellipse 


of + (1 — pl — 3p) =0 (12) 


A useful, and explicit, condition for synchronization is found 
from the stability criteria and eqn. (11) to be 


2F? > 0? + 0-25 
or > 2o%, whichever is the smaller . 


(13) 


The condition 2F{ > 207 (which is approximate) holds for 
small values of F,, and the condition 2F7 > o? + 0-25 (which 
is accurate) for large values of F,. 


(2.3) The Method of Andronov and Witt 


Assuming the oscillator to be synchronized, the solution can 
be represented by eqn. (5), with b, and 6, constant. Hence 
the behaviour of a synchronized oscillator can be represented 
by one point in a plane with axes b,; and b,. From eqns. (7) 
and (8) it is possible to obtain an expression for db,/db, which 
defines the direction of motion of the representative point of 
the oscillator for any values of b; and b,. (The representative 
point, describing the state of the synchronized oscillator, may 
move, for example, when the injection amplitude is increased.) 
In general 


db, ae P(h,, bo) 
db, O(h;, bo) 
and if the values of db,/db, at points in the b,)-plane are 


indicated by ‘flow lines’, a representation similar to that of 
Fig. 3 may be obtained. The point representing the state of 


(14) 


Fig. 3.—Plot of the flow-lines of slope db2/db; in the b;b2-plane of 
the oscillator. 


the oscillator will move along the flow lines in its vicinity, and 
it is possible to place arrows on these flow lines which indicate 
the direction of movement of the point with the passage of time. 
It will normally be found that the representative point tends, 
with time, to a particular point in the b,b,-plane. At such a 
point it is found that the values of P and Q in eqn. (14) are 
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simultaneously zero, and the point is called a singular point of 
the first-order differential equation (14). Such a designation 
also applies, and the condition P = O = 0 holds, at a similar 
point for which the arrows on the flow lines are reversed, leading 
away from the point; such a singular point represents an unstable 
condition. 

Andronov and Witt,°® in the first part of their paper, used the 
fact that a solution of frequency w, corresponds to a singular 
point of eqn. (14) to obtain the solutions (the locations of the 
singular points) and to investigate their stability. 

To determine the location of the singular points (for there are 
usually more than one for a given value of w, and A) P and O 
of eqn. (14) are made zero. In the case under consideration it 
follows from eqns. (7) and (8) that 


db es P(b,, b>) — AWy = abo(1 eas Py) + b, A, (15) 
db,  Q(b,, 52) abs(1 — py) — b Ay , 

By equating P and Q to zero the same expressions as were 
obtained previously by van der Pol’s method result for the 
location of the singular points and also for the output amplitude. 

The determination of the stability of the singular points, and 
hence also of the solutions, of eqn. (15) follows according to the 
criteria of Poincaré’ for the stability of singular points of first- 
order differential equations. Although the conclusions are the 
same as those arrived at by the method of van der Pol, the 
approach of Andronov and Witt to the study of the stability of 
solutions gives a deeper insight into the behaviour of the 
oscillator. 


(3) OSCILLATOR WITH TWO INJECTED SINUSOIDS 


The problem under consideration is the behaviour of a 
synchronized oscillator when the synchronizing signal is accom- 
panied by an interfering signal. Both signals are considered to 
be sinusoidal. The behaviour of the system is described mathe- 
matically by the non-linear differential equation 
dv a 2 : ge aoe 

(16) 


Experiments with an oscillator described approximately by 
eqn. (16) indicated that the only frequencies of any importance 
in the output of the oscillator, as judged by their component 
amplitudes, are w,, w, and the intermodulation product 
2w, — wz. Consequently, following the approach of van der 
Pol, a solution 


v(t) — b(t) sin wt se b(t) COs Wt 
== 53(t) sin Wt ae b4(t) COS Wot (17) 
= b5(t) sin (2w ae w>)t = b¢(t) COs (2w, = W)t 


is assumed, the quantities b,, b>, b3, b4, bs; and be being slowly 
varying functions of time. 

In order to obtain a simple solution to the problem it is most 
important to specify the relative magnitudes of the various 
quantities which appear when the solution (17) is substituted in 
the non-linear differential equation (16). This was done for the 
case of one injected signal (Section 2.2) by grouping the quantities 
under the headings of ‘small’ and ‘very small’. Because of the 
very large number of quantities under consideration in the 
present case, it is convenient to separate them into groups which 
are defined as being small of first, second or third order. Those 
which are small of first order are of the same order of magnitude 
as those classed as ‘small’ in Section 2.2. Those which are small 
of second order would come under the ‘very small’ classification 
of Section 2.2. There is also a group of quantities which are 
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much smaller than those in the ‘very small’ group; these 
quantities are said to be small of third order. 

So that the order of smallness of the various quantities involved 
can be specified, the following four assumptions are made: 


(a) The oscillator is nearly sinusoidal in operation. 

(b) w, and w, are not far removed from w» (<20% deviation, 
for example). 

(c) b3, bg, bs and bg are of an order smaller than J; and bp, 


ie. the interference and intermodulation components of the — 


oscillator output are small. 


(d) The maximum values of b, and 5, are of the same order — 


as a. 


From these four assumptions it is possible to compile a table 
(Table 1) showing the order of smallness of the quantities 
involved in the solution of eqn. (16). 


Table 1 


DEFINITION OF THE ORDER OF SMALLNESS OF SIGNIFICANT 
QUANTITIES IN THE ANALYSIS 


Small of 
first order 


Small of 
second order 


Small of 
third order 


a] wo 


(09 — @1) (Wo — @2) 
@o 7 9 


@o — (2m; — @2) 
@o 

ba bs 

ay ag 


b3 
do 


be 
ao 


by 


by 
A9%0 ay@o 


The assumed solution (17) is substituted in the non-linear 
differential equation (16). Any terms containing components of 
frequencies other than w,, w, or 2w,; — w, are neglected. By 
equating coefficients of the sine and cosine of wy ft, wot and 
(2w, — w )t on each side of the equality sign six equations are 
obtained. 
degree of smallness, and in each equation only those terms which 
are small of the two least orders are retained, i.e. if terms with 
orders of smallness 2, n +1,n+2.. 


and n+ 1 only are retained. The following equations then 
result.* 


. are present, orders n ~ 


2b, -- bo Ay == ab, (1 — Pv = (18) ‘ 
2b, = b,A, => abx(1 — Py) => oe = Awy 
1 
; (19) - 
2b, + b,A, = ab3(1 —=2 Ppa absp, + ADE Pm = (20) 
‘ ese 2 
2b4 aa b3,A, == ab, —— 2/1) as absps =F AbsPm — at Sa Bodg ’ 
2 
(21) 
2b, a beAyn = abs(1 = 2/1) + ab3ps + ab 4 Pm => 0 (22) 
2b — bsAyo — abe(1 —2p,) — abap, + ab3p,, = 0 (23) 


* An idea of the necessity of specifying the order of smallness of the various 


quantities involved in the analysis may be gained from the fact that, after substitutin: 
eqn. (17) in eqn, (16) and neglecting all the trigonometric functions indicated, thera 


are about 200 terms, including first- and second-order derivatives, whereas, if the - 


smaller terms are neglected as mentioned above, the number of terms is reduced to 


50 (before collection and simplification), and th - ivati 
Sreadit Segue CES a ) ere are only first-order derivatives 
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where Ay = 2(w9 — wo), Ay) = 2[wo — 2w, — w»)] (24) 
b? — BB 2byb, 
Ps = a, > m se a (25) 


Comparison of eqns. (18) and (19) with eqns. (7) and (8) 
cespectively shows that they are identical, and indicates that, 
within the accuracy of the assumptions made, the presence of 
a small-amplitude interfering signal has no effect on the response 
of the oscillator to the synchronizing signal of frequency Wy. 

Following van der Pol, a steady state is assumed, so that bj, bo, 
53, b4, bs and be become constants, and 


by b, b; by bs Bg OR (26) 


Substituting eqn. (26) in eqns. (18) and (19) the response of 
rrequency w, is obtained, and is identical with that obtained 
when no interfering signal is present, as seen above. Substitu- 
cion of eqn. (26) in eqns. (20), (21), (22) and (23) leads to four 
algebraic equations. These may be solved to obtain expressions 
(or b3, b4, bs; and bg. As these expressions are rather complicated 
they are given in the Appendix. Here, as with the case of one 
injected sinusoid, the condition (d) of Section 3, that the maxi- 
mum values of b; and b, are of the same order as apo, is removed 
(for the steady-state solutions. The interest, however, is in the 
amplitudes of the output components. The amplitude of the 
component at the synchronizing frequency w, is given by 
sea. (11). The output amplitudes at frequencies w. and 
12), — W have been normalized in the conventional manner 
with respect to ag: 


b2 aie b2 b2 aL b2 
[x= ; a 4, (Us 2 ) 6 (27) 
0 % 


such that p, is a measure of the square of the output amplitude 
of frequency w,, and p; is similarly related to the output of 
frequency 2w, — w,. Using this notation, the expressions for 
02 and p3 are found to be 
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The algebraic manipulations involved in arriving at the result 
are exactly the same; only the theoretical basis of the initial steps 
differs. Thus in van der Pol’s method, putting 6; = 5, =0 
because of the assumed steady-state solution, and in the method 
of Andronov and Witt, putting P = O = 0 because a solution 
corresponds to a singular point of the differential equation (15). 

The above solutions for 63, b4, bs; and bg may be shown to be 
stable both by the method used by van der Pol and by the 
criteria of Poincaré used by Andronoy and Witt. 


(3.1) The Discrimination against the Interfering Signal 


It is not immediately obvious from eqns. (28) and (11) that 
the interfering signal can suffer discrimination, but this may be 
verified by sample calculations and observed experimentally. 
The discrimination D will be defined by the equation 


__ Voltage gain to the synchronizing signal of frequency w, 
Voltage gain to the interfering signal of frequency w, 
(33) 


such that, for discrimination to occur, D must be greater than 
unity. It follows that 


Fy |p 
ad ohh wie ae 
and the substitution of eqns. (11) and (28) in eqn. (34) gives the 
following general expression for the discrimination: 
Die 
[o3 + (1 — 2p)" Joi + 1 —2p1)7] + pt —2pilo1292 + (1 — 2p1)7] 
[oF + 0 — py Yote + C= 2py)7] 


(35) 


Owing to the complexity of this expression, its significance is 
not immediately apparent. The variation of D with the inter- 
ference detuning o, is shown graphically in Fig. 4 for various 


p= rif a7, + (1 — 2p,)? H (28) 
*\[o3 + CU — 29;)"][o7, + A — 2p1)] + pf — 2pf[or2e2 + A — 2p))"] 
p= FH 5 cies 5} Bears Seem dh Oye) 
[o3 + A — 2p, Jlof, + A = 2p1)"] + pf — 2pilor2e2 + (1 — 2p] 
where Fy, = — Baolaga (30) values of o, and p;. With w, = wo, and with infinitesimal 
difference between the interfering and synchronizing frequencies, 
02 = Agfa, 12 = Asafa GBD such that o> = 0, the discrimination decreases as p, (and hence 


Examination of eqn. (28) reveals the most important result that 
the response to the interfering signal is linear, no matter what 
the interfering or synchronizing frequency. This fact makes 
‘calculation of the response very much easier than if the response 
had been non-linear. 

The equations for p, and p3 are complicated. This is only to 
ibe expected, however, for quite an accurate solution of a non- 
linear differential equation such as eqn. (16). A useful simplifi- 
cation occurs if w; = wo, ie. if the synchronizing frequency is 
the same as the natural frequency of the oscillator. In such a 
case the simplifying relations o, = 0 and o,. = — o2 may be 
used. It may then be shown that, as the interference detuning 
c- is increased, i.e. w, moved away from wo, the value of pp is 
asymptotic to the simple curve 


F3 
in 0 irl le ale 
The same expressions for b3, b4, bs and be and hence for p, 
avd p; are obtained by the method of Andronov and Witt, 
VoL. 105, Part C, 


(32) 


the amplitude of the injected synchronizing signal) is increased; 
for example, if py = 1-5 [Fig. 4(@)], the discrimination is 4-8 dB, 
rising to 11-2dB at o, = + 1, whereas, with a larger amplitude 
of injected synchronizing signal, such that p, = 3-0 [Fig. 4(5)], 
the discrimination is 4-1 dB, rising to only 5:3dB at on = +1. 

It should be noted that D is a measure of the discrimination 
against the signal of frequency w, and is not really a measure 
of the discrimination against interference, since the presence of 
the intermodulation frequency 2w, — w in the output may also 
be said to constitute interference. 

At no stage in the solution of the problem has it been assumed 
that the amplitude of the injected interfering signal is smaller 
than that of the injected synchronizing signal, i.e. that B< A, 
although in practice this is very often the case. The only 
assumption made regarding the relative amplitudes of interfering 
and synchronizing signals refers to conditions at the output of 
the oscillator. This fact is most important. It would appear, 
therefore, that if condition (c) of Section 3 is satisfied, i.e. if 
b3, by, bs and be are of an order smaller than 5, and 6, then 
it is possible for the injected interfering signal to be of larger 
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Fig. 4.—Variation of discrimination D with o2 for a number of values 
of o; and py. 


(a) 0, = 0, oy = 1°5 (symmetrical about o2 = 0) 
(b) 01 = 0, ey = 3-0 (symmetrical about a2 = 0) 
(c) 0, = — 1, 91 = 2:0 


Theoretical curve. 
Points marked are experimental. 


amplitude than the injected synchronizing signal, even though 
it is of smaller amplitude in the output, the discrimination being 
due to the selectivity of the tuned circuit together with the non- 
linearity. For the role of the synchronizing signal to be pre- 
served in such a case, the frequency of the interfering signal 
would, of course, have to be sufficiently far removed from w,, 
since the discrimination is a minimum when o;, and oy are 
approximately equal, as Fig. 4(c) shows. If the frequency of 
this larger-amplitude ‘interfering’ signal were made very nearly 
equal to the ‘synchronizing’ signal in frequency, the designations 
‘interfering’ and ‘synchronizing’ would then need to be 
interchanged. 


(4) EXPERIMENTAL VERIFICATION 
(4.1) The Circuit 
The non-linear differential equation (1) was found to be synthe- 
sized quite accurately by the tuned-collector transistor oscillator 
circuit of Fig. 5. In this circuit, limiting is defined by the non- 
linear law of two point-contact diodes connected back-to-back. 


-— 4-5 VOLTS 


Vd 


hy cos wt 


Fig. 5.—The tuned-collector transistor oscillator operating at 10 ke/s. 


Re = 6000, R = 4kQ, Rf = 100kQ, 
L=6-4mH, C = 0:044uF, Cg = 2uF. 


Transistor type OC71, crystal diodes type CG6E, 
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It may be shown that maximum regeneration results for a turns 4 
ratio of very nearly 2 : %y, where ay is the current amplification — 
factor of the transistor; a ratio of 2:1 is used in the experi- | 
mental circuit. The emitter is heavily biased (J, ~ 5mA) and | 
the amplitude of oscillation of v is kept low (<300mV peak-to- 
peak) for two reasons: first to ensure that the non-linearity of | 
the emitter can be neglected, and second so that the non-linear 4 
characteristic of the diode limiting circuit may contain no power 
of v higher than the third. | 
Assuming that over the region of interest the conduction 
characteristic of each diode follows the ideal-diode law | 


= Lew) (36), 


where J and V are the diode forward current and voltage respec- | 
tively, and J, and a are constants, then, if V is sufficiently small, | 
eqn. (36) can be approximated by 


| 


2 3 4 5 
T= Jal +5,V2 + 23 2 Vat qu) . 6D 
If the two diodes are identical, the conduction characteristic — 
iy = f\(vqg) of the two diodes connected back-to-back, as in | 
Fig. 5, can easily be obtained from eqn. (37).* It will contain _ 
only the first, third and fifth powers of vz. To determine the — 
conduction characteristic iy = f(v) of the back-to-back diodes — 
in series with Ry, the relation iy = f,(vg) must first be inverted 
to the form vg = f,(iz). From this expression one can proceed — 
to the characteristic v = f;(iz) by including the resistance Ry. 
A final inversion of v = f3(iz) to ig = f(v) gives the required 
conduction characteristic. It contains only the first, third and 
fifth powers of v, and an examination of the quintic coefficient — 
shows that the latter can be made zero by setting 


Ry, = 1/18al, . 


(38) | 


The conduction characteristic of the limiting circuit then con- — 
tains only linear and cubic terms, and the behaviour of the © 
experimental oscillator circuit is described quite accurately by 


eqn. (1). Under these conditions, the values of F and o are 
found to be given byt . 
ctyRRo 

Se 39 } 

Ry(R = Ro)ag ( ) | 

2CRRo { 

o = ———. (a — w 40) 

(a= Ro) 0 ) ( ) 


where the subscript 1 or 2 is appended to F, o, and w according 
to whether V = A or B respectively. 

The subscript 12 is appended to o when w = 2w, — w9; dg. 
is the amplitude of free oscillation; and Rp is the value of R- 
required just to extinguish oscillations. | 


(4.2) Measurements 


Two point-contact crystal diodes with almost identical charac- _ 
teristics were selected and the required value of R, to give only 
a cubic non-linearity was calculated from eqn. (38) to be 
1130 ohms. Experiments were carried out with the oscillator of. 
Fig. 5, with a natural frequency of about 10kc/s and with one 
injected frequency to ascertain whether this vaiue of Ry resulted 
in the behaviour described by eqn. (3). The method used was to 
approximate a theoretical response curve (p; versus o,) experi- 
mentally, and compare theoretical and experimental values of 
F, and 1 crit, Where 0; yj is the critical value of o, at which 
synchronization ceases. The theoretical and experimental curves 


* Small symbols are used in the remainder of the discussion, representing time-_ 


sets quantities. 
__} The circuit of Fig. 7 is such that, when F is given by eqn. (39), th i i 
indicated in eqns. (8), (19) and (21) do not need to be inde! (62), the apprommiaaa 
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‘were adjusted so that they coincided at o, = 0, and values of 


F; and 0 .i, were obtained for a number of values of ee Mit 
‘was found that the best fit between theoretical and experimental 
values of F, and o; ,,;; occurred for Ry = 900 ohms. For this 
value of R, the calculated F, was about 2% too low and the 
calculated 0; .,;, about 2% too high. An experimental response 
curve, with R, = 900 ohms, is shown in Fig. 6, and is seen to 
lagree with theory quite closely; it was therefore decided to use 
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A 'e, 
INSTABILITY INSTABILITY 
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ig. 6.—Experimental response curve for the oscillator of Fig. 5 with 
Ra = 900 ohms with one injected signal. 

Theoretical value of || = 2-00. 

Experimental value of |F;| = 1-99. 


— Theoretical curve. 
Points marked are experimental. 


the circuit of Fig. 5, with R, = 900 ohms, for the verification of 
any theory pertaining to a system described by eqn. (1). The 
condition for synchronization was investigated experimentally 
and showed reasonable agreement with eqn. (13), as is seen from 
Fig. 7. 


3:0 


FOR SYNCHRONIZATION: 
= | 2F2>02+0-25 
en | 22> 20% 


+250 


OO +100 +150 +200 
CRITICAL EXPERIMENTAL DETUNING, c/s 


or 


Fig. 7.—Verification of the condition for synchronization of the circuit 
of Fig. 5 with Rg = 900 ohms, showing maximum detuning 0} eriz 
for synchronization as a function of |F;|. 


—- Theoretical curve. 
Points marked are experimental. 


Experimental results obtained with both a synchronizing and 
a”, interfering signal are presented in Figs. 4 and 8, showing 
sxvellent agreement with theory. The three graphs of Fig. 4 
are of the discrimination D plotted against the interference 
detuning a, for a number of values of p, and o;. Fig. 8 is a 
3}ot of the output level of the intermodulation frequency, relative 
6 che output of the synchronizing signal, to a base of interference 
de-uning o», the output level of the interfering signal being shown 
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RELATIVE INTERFERENCE LEVEL AT INJECTION 


BELOW THAT OF SYNCHRONIZING SIGNAL, dB 


(e) 1:0 6-0 


2:0 3:0 4:0 5:0 
INTERFERENCE DETUNING o> 


LEVEL OF OUTPUT OF INTERMODULATION FREQUENCY 


=100 =200 =300 
EXPERIMENTAL INTERFERENCE DETUNING, c/s 
Fig. 8.—Variation of the output of the intermodulation frequency 
21 — 2 with o2 for the case 0, = 0, p1 = 1°5. 


— Theoretical] curve (symmetrical about o2 = 0). 
Points marked are experimental. 
—--- Theoretical curve of the relative output level of the interfering signal. 


for reference. The injected amplitude of the interfering signal 
was one-tenth of that of the synchronizing signal in the cases 
considered. The linearity of the response to the interfering 
signal was, in fact, verified experimentally. In this verification 
it was found that the linearity ceased to hold when the output 
amplitude of the interfering signal was greater than 0:4 of the 
output amplitude of the synchronizing signal. At such an 
amplitude of the interfering signal condition (c) of Section 3 is 
violated to quite a large degree. 

It was verified that, if the interference detuning o> is sufficiently 
large, the amplitude of the injected interfering signal can exceed 
that of the synchronizing signal while still allowing discrimination 
to occur. In the experiment p, was set at 1-5, and with w; = wo 
and the interference detuning o, set nearly equal to 4-0, the 
discrimination remained sensibly constant as the level of the 
injected interfering signal was raised from 20dB below to 10dB 
above that of the synchronizing signal. 


(5) CONCLUSIONS 


A solution has been obtained for the non-linear differential 
equation (16), representing the synchronization of an oscillator in 
the presence of interference, and an expression for the discrimina- 
tion against the interfering signal has been derived. The dis- 
crimination is expressed as a continuous function for frequency 
separations ranging from zero (i.e. infinitesimal) to some 20% of 
the natural oscillation frequency. Under the assumptions made, 
the response to the interfering signal is shown to be linear. The 
validity of the method of solution and its accuracy have been 
verified experimentally, very close agreement between measured 
and theoretical results having been obtained. Moreover, the 
assumptions made are considered to be realistic in practice. 

Experimental results also appear to indicate that the simple 
circuit of Fig. 5 is a suitable electrical analogue of the non-linear 
differential equation (1) and may therefore be useful in any 
experimental investigations of systems described by eqn. (1), 
electrical or otherwise. 
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THE ENERGY-INTEGRAL CRITERION OF TRANSIENT STABILITY LIMITS OF 
POWER SYSTEMS 


SUMMARY 


Methods have been devised, in the study of second-order non-linear 
differential equations, for identifying the nature of the phase-plane 
rajectories, without having to find the solutions to the equations. 
hese methods are applied to determine the transient stability limits 
or a power system with two machines and are shown to be equivalent 
o establishing the energy integrals of the system. Formulae are 
derived for the critical switching time taking resistance into account. 
The methods are then generalized for multi-machine systems both with 
and without an infinite busbar, and the energy integrals are given. 
‘he application of these integrals, in conjunction with step-by-step 
integration, to find the critical switching time for a fault is described. 
An example of a three-machine system solved by the conventional and 
by the new methods is given. 


LIST OF SYMBOLS 


@ = Rotor angle, rad. 
@ =v, velocity of slip with respect to synchronous speed, 
rad/s or rad/(generalized time unit). 
mie— Lime, sec. 
7 = Generalized time. 
B = Generalized mechanical torque. 


Ty, = Mechanical torque per unit of the rth machine. 
_T,,, = Electrical torque per unit of the rth machine. 
M, = Inertia constant, kW-sec?/kVA radians of the rth 
machine. 
V, = Voltage per unit behind transient reactance of the rth 
machine. 


Z,, = Driving-point impedance. 
_Z>» = Transfer impedance. 
fpr = Complement of impedance angle. 


(1) INTRODUCTION 


When a power system is being designed it is usual to attempt 
o establish by means of analytical procedures that the system 
nas adequate transient stability margins. The method of doing 
this, which has been well described in the literature,! amounts to 
the solution of the simplified electro-dynamical differential equa- 
iions of the system by means of numerical step-by-step integra- 
(ion. The solutions obtained are in terms of the machine rotor 
angles as functions of time. When the relative rotor angles 
appear to be increasing indefinitely, the system is said to be 
unstable, and when they appear to be going towards zero the 
wystem is said to be stable. When there are more than two 
machines there is, at present, no definite criterion for stability. 
ip this paper, a new approach has been made to the subject, 
oeed upon the methods used in the study of non-linear mech- 
wnics. The electro-dynamical equations of a power system are 
aea-linear, but by finding the singular points of the differential 
»g@uations, and obtaining an energy integral, a definite criterion 
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for stability may be obtained. This appears as a generalization 
of the well-known equal-angle criterion for two machines, but 
in order to obtain solutions in systems with many machines 
certain approximations are necessary. 

The use of the energy-integral method has several advantages. 
First, a definite value is obtained for the time in which a fault 
must be cleared by carrying out only one forward integration. 
Secondly, the stability or otherwise of the system can be deter- 
mined mathematically, instead of attempting to assess the general 
nature of the solution from the slope of the rotor-angle/time 
curves. Perhaps the most important fact is that the new 
approach provides a basis for future theoretical work in a field 
where empirical methods have hitherto been the rule. The 
application of these methods to a two-machine case is first 
developed, in order to provide a simple basis for extension of the 
ideas to systems with many machines. 


(2) TRANSIENT STABILITY, TWO-MACHINE CASE 
(2.1) Nature of Problem 


Transient stability problems involve consideration not of the 
solutions of the differential equations for a system, but of the 
general nature of these solutions. The problem is thus par- 
ticularly suitable for investigation by topological methods 
developed in the field of non-linear mechanics.23 Results 
for second-order non-linear differential equations are well 
established, and the two-machine system, subject to the usual 
assumption of constant flux leakages in the machines, is charac- 
terized by equations of this type. 

The differential equations for any system with no more than 
two synchronous machines can be reduced to the form 


6 = B—sin®@ . Ste SRC) 
(see Appendix 7.1) 
where differentiation is carried out with respect to a generalized 
time variable. 

Introducing the slip velocity v, this may be written 


dv 
— = B—sin® Zz 
v 70 B—sin (2) 
and integrating, 
4) 
=z = BO + cos 8 — By — cos A ei el) 


when the initial conditions are 


v=0,0=% 


(2.2) Phase-Plane Trajectories 


Eqn. (3) expresses the law of conservation of energy for 
the system, that kinetic energy, v?/2, and potential energy 
(—BO — cos 6) are together equal to a constant (—B8y — cos 4p). 

In the nomenclature adopted by writers in the field of non- 
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linear mechanics, the plane of the variables v, Q, is the phase 
plane, and a curve corresponding to eqn. (3) is a phase-plane 
trajectory. 

It will be noticed that the phase-plane trajectories, as the name 
implies, are not solutions to the differential equation (1), since 
time does not appear as a variable. A trajectory Is the path of 
a representative point. There is an infinity of solutions corre- 
sponding to different time origins for a given trajectory. The 
motion of a representative point along a trajectory corresponds 
to one of the solutions. . 

Two types of trajectories can be obtained, normal trajectories 
and degenerate trajectories, or singular points. For all normal 
trajectories dv/d0 is determinate, but for a singular point dv|do 
is indeterminate. Thus the singular points for eqn. (2) occur at 


sinO=0. (4) 


Ve— 0B 


as can be seen from eqn. (2). 
The singular points correspond to points of stable and unstable 


equilibrium. If, 


B > 1, there are no singular points. 

B =1, there is one singular point at 0 = 7/2. 

B <1, there are two singular points, P,; and P, given by 
6, = arc sin B and 0) = 7 — arcsin B. 


Poincaré+ has shown how to classify the singular points of 
both conservative and dissipative systems of the second order. 
In this case 6, is a point of stable equilibrium, a vortex point, 
and @, is a point of unstable equilibrium, a saddle point. 

The nature of the phase-place trajectories for B < 1 is shown 
in Fig. 1. These trajectories are obtained from eqn. (3), each 


Vv B<! 


NON. OSCILLATORY TRAJECTORIES 


OSCILLATORY TRAJECTORIES 


Fig. 1.—Phase-plane trajectories for two-machine case. 


trajectory corresponding to different values of 65. Three kinds 
of trajectory can be distinguished; those around the point of 
stable equilibrium, P;, which are oscillatory; those approaching 
the point of unstable equilibrium, P,, which are non-oscillatory ; 
and a single trajectory which enters P,. This last trajectory 
divides the region of oscillatory trajectories from the region of 
non-oscillatory trajectories, and is called the separatrix. 


(2.3) The Critical Switching-Time Problem 
(2.3.1) The Critical Switching-Angle. 

The critical switching-angle problem is the one which occurs 
most frequently. It arises as follows: 

‘A power system is in a condition of stable equilibrium and the 
accelerations are zero. A fault occurs on the electrical system 
changing the parameters of the differential equation. In 
practical cases the electrical power output is reduced, and with 
the mechanical power input remaining the same, the machines 
will accelerate. After a certain interval of time, the fault is 
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cleared from the system and the differential equation has a 


third set of parameters.’ 

This is the system whose stability must be investigated. The 
effect of the fault is to give the machine a certain velocity and 
angle when projected into the final unfaulted condition. tae 

The problem is illustrated in Fig. 2. Initially the velocity is 
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Fig. 2.—Critical switching angle. 


zero, and the system parameters are such that B= By. The 


initial angle is therefore 


6) = arc sin By 


(5) 


There is then a fault and B becomes larger, changing from Bp to * 
B,, where B, is usually greater than unity. The representative | 
point follows the phase trajectory for the faulted system OCD. — 
The fault is now cleared and B becomes B, less than unity. Ifthe ~ 


4 
J 
{ 
} 
{ 


| 
| 

; 
| 


4 


representative point is on the portion of the trajectory OC at ; 
this time, it will follow an oscillatory trajectory, but if it is on — 
the portion CD, a non-oscillatory trajectory. If it is at C at — 


the clearing time, it will follow the separatrix to the saddle 
point P,, which, however, it will not reach in a finite time. The 


wel ay 


angle @, corresponding to a point C is the critical switching — 


angle. 
(By — CB,)0@, + cos 8, — C cos (8, — ¢,) 
= By — CB. — O1¢ + $) + cos Oy + Coos 64, 


An expression for this angle, derived in Appendix 7.1, is : 


6) 


This equation cannot be directly solved in the general case, but © 


often the quantities (By — CB,) and ¢, will be small, and an — 
iterative method may be used to find a good approximate solu- — 
tion quickly. It is shown in Appendix 7.1 that, for certain cases, i 


eqn. (6) reduces to the expression 


(2.3.2) Approximate Formulae for the Critical Switching Time 


(1 — C) cos 8, = cos Oy + Ccos 61, — B,[7 — (8) + 9;.)]. 


which corresponds to that given in Reference 1 for the case — 
where there is no resistance. 

The critical switching angle has been found, but it is the critical - 
switching time which is required. This is the time required for 


the representative point to traverse the trajectory OCD in Fig. 2 
from O to C. 


The differential equation of the system under fault conditions _ 


is given by 


d,0 


G2 — By — sin @ 


(8) 
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It is required to find the solution of r = F(6) and to find the 
value T, corresponding to 


7.= FO) —FO@) .-. .. . » (9) 
where 0, is determined by eqn. (6). 


Oe 
Also =. =| a (10) 
Q ¥ 
Integrating eqn. (8) once and substituting in eqn. (10) 
tenes d 
“tp (11) 


~ 4/240, +/(B/O + cos 0 — BO, — cos 0p) 


The expression cannot be integrated, except for the case where 
1B = 0, when an expression in terms of elliptic integrals is 
‘obtained. In general, either an approximate, numerical or 
‘mechanical method of integration must be used. Numerical 
step-by-step integration is well known, and the method that will 
be presented here consists in replacing the expression under the 
root sign in eqn. (11) by another expression which can be 
integrated. 

In practical problems, the range of integration 0p to 0, is less 
than 7, and over this range the cos @ term may be expanded as a 
power series in 0. 

In addition, in almost all generator problems 0 > 6) > gf ‘ 
and if 7, is not to be infinite, B,@, must be greater than 0-725. 

This means that the cos @ term may be replaced by the series 
iterminated at 62, and this will give a sufficiently accurate solution 
im practical cases. Ifa higher degree of accuracy is required the 
\series may be carried to the terms of the third and fourth power, 
when solutions in terms of elliptic integrals will be obtained. 

_ The details of the quadratic approximation are given in 
ppendix 7.1, and the formula for the switching time is found 
(to be 


2 
Te a/=[are cos +/R/(R + 6,,)] (12) 
here bn, = (8. — o)/2 
Pa. sin (6,, + 6) sin 8, — eae (8,12 + 8) sin 6,,/2 (13) 
_ BA, — sin O,, + 9) sin 6, 
and R a6, 


Eqns. (6), (12) and (13) enable the critical switching time to be 
calculated for any two-machine system where there is only one 
change of system connections following the fault. 


‘2.3.3) Use of Step-by-Step Integration in Conjunction with the Energy 
Integral. 

The method described above of finding 7, is not easily applied 
hen there are two or more system changes subsequent to fault. 
system studies, these conditions arise when there is sequen- 
rial clearance of circuit-breakers, or reclosure following fault 
clearance. In order that these problems may be investigated 
easily, a new method of deciding the stability status of a trajectory 
's introduced here. and it will later be generalized for systems 
sataining more than two machines. It has been shown in Sec- 
cion 2.2 that, if the representative point is within the area bounded 
oy the separatrix at the instant of switching, it will follow a 
stable trajectory, and if outside, an unstable trajectory. If the 
representative point is on the boundary at the time of switching, 
th’s is the critical switching time. 

fhe separatrix represents a particular solution of the dif- 
Ye-ential equation in the phase-plane, and this solution can be 
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expressed as an energy relationship. The differential equation 
for the unfaulted system can be written 


M6 = Tine — Te: Sin (0 + $,) . (14) 


The reduction of the general two-machine system to this form is 
shown in Appendix 7.1. 
The saddle point is at 


Oie = 7 — arc sin(T cl Tne) — be = 7 — O1¢ — de 
Multiply eqn. (14) by 6, and integrating with @ and 61, as 
limits, 
Mv?/2 = Tc + Te, cos (8 + $,) 
=A ye peer ee COSU Ts 
0; =. are sin (7.,./T4,) 


(15) 
where 


The energy equation (15) is satisfied on the separatrix, and the 
corresponding energy curve is given in Fig. 3. 
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Fig. 3.—Energy curve. 


Writing eqn. (15) in a new form with an arbitrary constant 4 


2 
a — [T,c9 + To, cos (8 + ¢,) 


om Tat 2 B1 -_ oe) =f Te COs G16] =A (16) 


The instant of changing to the unfaulted system will be charac- 
terized by certain definite value of v and 0, and if these are 
substituted in eqn. (16) a value for A will be obtained. 

If A is zero, eqn. (15) is satisfied, and this must be the critical 
switching time. Since the kinetic energy term is always positive 
it is clear that A must take negative to positive values when 
passing from the stable to the unstable region. The criterion of 
stability is thus given by 


A < 0—stable 
A = 0—critical 
A > 0—unstable 


The general method of solving a two-machine transient 
stability problem is, for the final unfaulted system, as follows: 


(a) Find the saddle point. 

(b) Establish the energy equation, giving a zero of energy (a 
minimum) at the saddle point. 

(c) Substitute appropriate values of v and @ from the faulted 
system and test for stability by eqn. (16). 


The use of the energy integral in this way can greatly reduce 
the amount of step-by-step integration necessary, since a solu- 
tion need be carried on only until the instant corresponding to 
the critical switching time. In addition, a definite criterion for 
stability is available, and this means that only one forward 
integration need be carried out to obtain a result which is 
definite. 

Most transient stability problems involve more than two 
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machines. In the next Section, therefore, this method will be 
generalized for multi-machine systems. 


(3) TRANSIENT-STABILITY MULTI-MACHINE SYSTEMS 
(3.1) General 


In testing a power system for transient stability, either by 
using a network analyser or by calculation, a fault is assumed of 
a certain duration, and the motion of the machine is calculated 
by step-by-step integration forward in time. In general, the 
mechanical powers are fixed prior to the fault and cannot satisfy 
the condition that the accelerations become zero. Thus thesystem 
is said to be stable if the relative accelerations appear to be 
tending to zero. There has been no precise criterion available 
such as was obtained for the two-machine problem, and conclu- 
sions concerning stability have been obtained by judgment and 
experience. Certainly if two step-by-step integrations are carried 
out for different fault durations, two fault times can be found, 
such that the relative machine angles, during the first 360° in one 
case, appear to go on to small values, and in the other, to go to 
continuously increasing values. In the two-machine problem it 
has been shown that the phase-plane is divided into a zone 
of stable and unstable trajectories for certain values of a 
parameter B. The terms ‘stable’ and ‘unstable’ refer to the 
type of trajectory from the point of view of the synchronous 
operation of the power system—both trajectories are stable in 
the mathematical sense. It appears likely that some similar 
division exists for the 2(n — 1) dimensional phase space of an 
n-machine system. Indeed, the fact that the transient-stability 
problem is posed at all presupposes such a division of the phase 
space into stable and unstable zones. It would be expected that 
the surface of division, the separatrix, in the phase space passes 
through the saddle point or rather a point analogous to the 
saddle point of the two-machine system. If the integral can be 
found to give the equation of the separatrix, this can be used to 
test for stability by the method given in Section 2.3.3. 

The extension of these concepts to multi-machine systems is 
beset by many difficulties. There exists no general mathematical 
background such as has been provided by Poincaré and other 
workers for two-variable systems. The simple ideas cannot 
merely be used without development since multi-machine systems 
produce new phenomena not existing in two-machine system. 
Three difficulties arise: first, it is difficult to find the singular 
points; secondly, many singular points exist, and a special one 
of these must be identified; thirdly, it is only possible to carry 
out an exact integration where there is no resistance in the 
system. The difficulties can be met in various ways, and it will 
be shown that practical use can be made of these new energy 
methods. 


(3.2) The Energy Integral 
(3.2.1) Systems with Finite Inertia. 
The differential equations for an n-machine system may be 
written in the form 


M0 SST. = tae Eee Emel fT) 


where there are m equations, with values of r from 1 to n. 


p=r—1 p=n 
Now Ter >" >» Sip Srp 2 x Sip (18) 
p=1 p=r+1 
EV, 
where Sip = a ? sin (6, — 0, — dy») (19) 


where p takes values from 1 to n. 
A set of n(n — 1)/2 relative acceleration equations can be 


established from the set of m eqns. (17) by combining every 
possible pair of equations. 
Consider for example the rth and pth equations of the n set. 
Multiply the rth equation by M, and the pth equation by M, 
and subtract, 
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M,M,(6, — §,) = MyT me — M,Tmp — MpTer + MT 


I 
Now n(n — 1)/2 equations of this type can be be | 
each equation is multiplied by the integration factor for the am 
hand side a and the whole set is added together, an integrated — 
expression can be found of the form, 


"SS ee SS ee 
r=1 p= r+ly SM. r=1 p=r-+1 >> M. 
r=1 r= . j 
® -=n-1 p=n p=n E,E, 
+ Dies | M, >} = P sin 6, — 0, —@,,) : 
0 r=lp=rtl p=1 rp 


EMS 


r=1 


— ere (0, — 0, — $0) Oar . QD 


It should be noted that the terms p = r in the integral produce 
constant terms. These should be written with a minus sign. It — 
will be found in practice that part of this integral can be found — 
directly, ice. where the p’s and r’s in one sine term equal the r’s — 
and p’s in the other. 

There are n(n — 1)? terms under the integral sign of the right- 
hand side of the equation, and of these, n(n — 1) terms can be 
exactly integrated, i.e. 1/(z — 1) of the whole number of terms. 
Thus for two machines all the terms may be integrated, for three, 
one half of the terms, for four, one third of the terms, and so on. 
In fact, the situation is rather more favourable than this, because 
of the terms that cannot be integrated exactly, each pair of On 
and 0,0 form the equivalent to a single term of the kind that can 
be integrated. Thus, for three machines, there are 12 terms, of 
which six can be exactly integrated and the remaining six terms 
become three terms if certain of the Ppr terms are neglected; thus 
the two-thirds of the final terms can be directly integrated without 
approximation. In general 2/n of the terms of the final expres- 
sion can be integrated exactly, and the remainder approximately. 

The integral can be found directly, and eqn. (21) can be much ~ 
simplified if all the small angles Ppr are neglected. If at the same 
time the values of the mechanical power T,,,, are changed so that 
eqn. (20) is satisfied at the critical point, "the integral will be | 
correct at this point. It is not thought that this assumption will 
produce large errors in the integral since each of the angles Por 
appears also as ¢,,(= — ¢,,). | 

As an example, the integral for a three-machine system is given — 
in Appendix 7.2. 

For all Por = 0 the corresponding general expression is 
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r=n—1 p=n 


Vin 


a >» eZ Z, [sin Onn — Ic) + 08 8, — cos Ochi 
(22) 
(3.2.2) System with an Infinite Inertia. . 


If the velocity of one of the machines in the system is fixed, 
absolute, rather than relative accelerations determine the stability tl 
criteria. In practice, this arises when a small power system is 
connected to a large power system. The machines in the large 
system can be assumed to have infinite inertia, or at certain 
points there is assumed to be an infinite busbar. Under this 
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circumstance the general expression (22), derived when all the 
‘dp Were assumed to be zero and the mechanical powers are 
changed to satisfy eqn. (17), reduces to 


s M,@2 
| r=1 2 


r=n—1 p=n 


a>) oe [sin 8, n(8,» — P,n-) + cos 6,, — cos 8 


= ep aeed oi Le 


(23) 


(3.3) Singular Points of the Differential Equations for 
Multi-Machine Systems 


The singular points are obtained by equating eqn. (20) to zero, 
sand then solving the equations for the (nm — 1) independent 0 
\ variables. 

In order to employ the energy integrals usefully it is necessary 
| to find, and generally classify, the singular points. This has been 
‘done for the equivalent of a two-machine system but higher-order 
ssystems have been little explored. In these systems a large 
‘number of singular points may be found. A three-machine 
‘system with no resistance, for example, has been found to have 
aS Many as six singular points for certain values of the parameters. 
Only the singular point which displays stable vortex properties 
for all variables will give a point of stable equilibrium for the 
(power system. The other singular points will display saddle-type 
‘or unstable vortex-type properties. It is a particular saddle-type 
point which is required in order to find the constant for the 
energy integral. 


(3.3.1) Finding the Singular Points. 

The sitigular points can be found by graphical methods for a 
three-machine system or by using a network analyser for all 
systems. 

The problem is to solve the simultaneous eqns. (20) when the 
‘left-hand sides are zero. The network analyser is simple to use 
ywhere the absolute accelerations are to be zero because then 
T., = Tm. However, where the relative accelerations are 
required to be zero, the network analyser must be balanced for 
some absolute acceleration a, which is constant for all machines 
but different for each singular point, so that for the rth machine, 


Ly a ies 
M, 


An example given later shows that this can be done with 
reasonable accuracy on a network analyser for a three-machine 
system. For systems with more machines, some modification 
could be made to the analyser to facilitate the work. Where 
ithere is an infinite busbar in the system, eqn. (17) applies, a = 0, 
and balancing becomes a simple matter. 


\(3.3.2) Identification of Singular Points. 

There exists a large literature on the classification of singular 
‘points in the sense that this reduces to the problem of finding 
ithe extrema of the potential-energy expression and distinguishing 
ithe various types of extrema (maximum, minimum and saddle- 
‘type). In practice, it is required not so much to distinguish 
between maximum, minimum and saddle points, but to decide 
which of a number of saddle points should be used to define the 
‘constant of integration of the energy integral. This can only 
be settled by considering the topology of the phase-plane 
‘trajectories in a particular case. To explore the whole phase- 
ypiane (or space) is a large undertaking, and it is suggested that 
ithe following practical rules will enable the correct saddle point 
© be found. 

(a) Set the equations, and select the variables so that the 


=2 


Tny’S are, as far as possible, positive. In a given case some of 
the T,,,,’s may be negative, but the sense of the variables should 
be selected so that the equations reduce predominantly to the 
form in which T,,,, is positive and T, is negative. It is clearly 
necessary for stability to be possible, for the mechanical torques 
to be a driving force, and the electrical torques to be a resisting 
force. 

(b) The saddle point to be selected is that which gives a 
maximum angle in the positive direction of the variables. 

It is appreciated that the selection of the correct saddle point 
may be difficult in some cases. It is considered that some more 
definite criterion can be developed if further work is undertaken. 


(3.4) Energy Integral in Transient Stability Studies 


The method is a generalization of that already presented 
for two machines in Section 2.3.3. The energy integral for 
the n-machine case as developed in Section 3.2 specifies a 
2(n — 1)-degree surface. If this surface passes through a saddle 
point it will, under certain conditions, separate the regions of 
stable and unstable trajectories in the phase space, i.e. it will be 
the separatrix. 

The transient-stability problem consists in determining the 
position of the representative point of the system with respect 
to the separatrix. It is required to find whether the representa- 
tive point is in the region of stable or unstable trajectories, i.e. 
whether it is inside or outside the separatrix. This is not 
difficult to determine since the kinetic-energy term of the energy 
integral is always positive. 

The separatrix is given by eqn. (22), where 0 
the co-ordinates of the saddle point. 

Consider a system subject to a disturbance. At some time ?, 
the system velocities and angles are given by @,,,, 9,5, and the 
system has been changed to the configuration used to determine 
eqn. (22) above. 

Substituting the velocity and angle in eqn. (22) and rewriting 
the equation in a new form with a constant A on the right-hand 
side, 


pc then represents 


r=n—1 p=n 


M,M, 


St ts rl rg ane 6 6 
= » : x a [sin rock gee) ae kas + cos TpO cos pe | 
r= =r rp 


=A nA) 
If A < 0 the system will be stable, 
A = 0 the system will be critical, 
A > 0 the system will be unstable. 
The particular form of the integral used is immaterial. The 


form based upon absolute accelerations could equally well have 
been used if the system had an infinite busbar, and a rather 
simpler expression would have been obtained. 


(3.5) Example of the Use of the Energy Integral 


The example is of a three-machine transient-stability study, 
and the system to be studied with its initial conditions, as set 
up on a network analyser, is shown in Fig. 4(a). In order to 
check the method, the problem was first solved by step-by-step 
integration in the conventional fashion. The swing curve for a 
fault clearance time of 0-15sec and reclosure of the line at 
0: 325sec is given in Fig. 4(b). This appears to be just unstable. 
The results of a swing calculation for a fault clearing time of 
0-15sec and reclosure in 0-30sec appeared to give a stable 
condition. 
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Fig. 4.—Three-machine problem. 


(a) Initial conditions. 
(6) Swing curves. 


The following procedure must be adopted in order to use the 
energy-integral method of solving the critical switching-time 
problem. 


(a) Set up the post-fault system. In this case, it happens to 
be the same as the pre-fault system, since the faulted line is first 
switched out, and then reconnected after an interval of time. 

(6) Find the transfer and driving-point impedances, and the 
generator voltages ‘behind transient reactance’. 

(c) Set up the integral, using the expression based upon relative 
accelerations. 

(d) Find the singular points, in this case using the criterion of 
zero relative accelerations. 

(e) Select the saddle-type point, which gives large angles (around 
m) for those variables, which have positive resultant mechanical 
powers, or the least negative mechanical powers. 


The generator equations for this system, obtained as above, 
are, on a 100 MVA base, 


18 5 

750 1 =0-72 —0°577sin (612 + 8°) —1-032sin (0,3 — 8) 
0-866, Y 

3°, 0-17 -+1-032sin(9,5-+8) +0-90sin(A,,—1 

750 3 13 sin(023 — 10) 


The equivalent mechanical powers for the relative acceleration 
equations will be 


Trip 0860010: 72.—-18 x (0-26) = 5<305 
Tigi On 8 < O17 == 0-54 
To; = 3-5(—0-26) — 0-866 x 0-27 = — 1-06 
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Thus the singular point which must be found is 
in the region where (15, 913, 923) is (7, 7, 0) ae 

For the zero relative-acceleration criterion, it 1s 
necessary to balance the network analyser so that 
6, = 6, = 6; =a, some unknown constant, which 
will be positive. d 

In the study of the network analyser the quanti- 


P=170 
Q=78 


E; + '-l65 ties measured on the analyser are the angles and ' 
Ae: “155 electrical powers for each machine. The mechanical 
ve: 3-5 powers are known; balance is secured when the 
quantities 
i ae ay TS Ten he T.3 are equal. 
A, H, A 

In this study, which was first carried out by this 
method, this was found to be difficult. However, 


the provision of a special wattmeter which would 
enable the values (T,,; — Te1)/Hj, etc., to be read 
off directly would greatly assist in obtaining balance. 

The solution found in the neighbourhood of 
O12 = T7, 0,3 = 7 Was D126 = 193% O13¢ cs 167°, and 
hence 8,3. = — 26°. 


the transfer impedances are assumed to be zero, 

and the mechanical powers are modified so that 

the integral remains zero at the saddle point, the 

expression to be used is that given in eqn. (23). 
Substituting the values, we have 


“= 


x50 9762 + 2-8262, + 0-13663,) 


= (0:3616,5 = 0:0516,; + 0-557 cos 615 
-+ 1-032 cos 0,3 0:90 cos #,3+ — 0-329) = A 


From the step-by-step integration of the system under fault 
conditions, the velocities and angles are found to be 


Stable 
Unstable 


By linear interpolation, the critical switching time is 

0:55 

0-30 + —— 

at 0:85 

In the actual study undertaken by conventional methods it 

was concluded that 0:325sec was the critical switching time. 

There is thus no appreciable difference in the time found by 

each method. 

The positions of the singular points were also found by plotting 

the two independent relative acceleration equations obtained 
from eqn. (25). These are shown in Fig. 5. 


x 0:05 = 0-33sec 


If the small angles due to the resistive part of . 
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Fig. 5.—Three-machine problem. 


Relative acceleration equations. 


The four roots are 


The last three roots give unstable trajectories and the singular 
point of interest is (ii). It will be noted that the values are 
3-5 % different from those obtained by using the network analyser. 

Again, substituting the angles and velocities from above, we 
have 


—0-54 
0-31 


Stable 
Unstable 


By linear interpolation, the critical switching time is found to 


0-54 
be 0-3 a 0-055 O23 5scc: 
ae 0°85 
Thus the inaccuracy in the determination of the angles on the 
network analyser did not affect the results. 


(4) CONCLUSIONS 

In the paper, the author has attempted to give a new orienta- 
tion to the study of power-system transient stability limits by 
introducing the concept of energy integrals and singular points 
used in the general study of non-linear differential equations. 
In Section 2.3 these methods are applied to the two-machine 
stability problem, and an equation is given for the critical 
switching angle, which, for the first time, includes the effect of 
resistance. An approximate, but sufficiently accurate, expres- 
sion for the critical time is also derived, and the two equations 
taken together constitute a complete solution for the two- 
machine case. 

The carrying out of a transient-stability study by the established 
methods® where there are more than two machines requires a 
large expenditure of effort. The actual process of forward 
integration, using the electrical powers obtained from a network 
analyser, is lengthy. In addition, a number of forward integra- 
tions with different clearing times—two is the minimum—must 
be carried out in order to establish the critical time. The 
establishment of the energy integral may require some effort, 
but a definite value for critical switching time is obtained with 
only one short forward integration. If there are many types 
and situations of fault to be studied which result in a common 
unfaulted system, as would be the case in a study of auto- 
reclosing, the same integral may be used in each case. 

It is true that some approximations are necessary in order to 
establish the integral, but the nature of these is such that they 
will partly cancel out in the integral, and have little effect on the 
final result. 
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So far only three-machine problems have been studied in 
detail, but a surprisingly large range of problems can be reduced 
to this without a great loss in accuracy. The method is, however, 
capable of development and this work is now being undertaken 
for systems with a larger number of machines. It is thought 
that the energy-integral method can be applied easily when a 
digital computer is used for transient stability studies. Once the 
correct value of the constant for the energy integral is established 
by finding the co-ordinates of the important saddle point, the 
value of the integral can be found at each step by appropriate 
operations on the instantaneous velocities and angles which are 
naturally available during the process of forward integration. 
When the value of the integral is zero, the critical time can be 
printed out by the computer, since this is the only information 
actually required. 

The development of methods for investigating non-linear 
systems with many variables is important in many fields of 
engineering, and it is hoped that the paper will make some 
contribution to this subject. 
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(7) APPENDICES 


(7.1) Derivation of Critical Clearing-Time Formulae for the 
Two-Machine Case 


(7.1.1) Critical Clearing Angle. 


The general expression for a two-machine system may be 
written 


rn V2. VAN Z 
AGO Be 3 — sin ——1 2 sin (6,5 — 
Sicha ap Pit 7. (912 — 42) aA 
: V2 VV. 
M6, =T.. — — sin Bee ei (nae 
202 m— 7 br2 Ze (O12 + 42) 
This may be reduced to a relative acceleration equation 
M2612 = Tz — Te Sin Bip . (27) 
M,M 
where M,, = ae 
12 M, + ™; 
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ee s Vi. mM 
(Tm — z'sin bir) M2 (Tn Fone 1 


m12 — M, ae M, 
M, ) (28) 
= = —_—______—'f 
O12 = O12 + $i23 P12 = are tan Gas Ne an $4» | 
Sn ae all 1 4MiMp sin? ay | 
el2mre Zin (M, + M,)? 


Eqn. (27) may be easily reduced to 


d*0 
“dr 


Re Tint (2) ex ss OF 


Te12” 
a r= fg 


From eqn. (29) it is shown in Section 2.2. that 


= B—sin®@ 


where 


2 
D3 = BO + cos 0 — B6y — cos I 
for the condition v = 0 at 0 = %. 

The expression for the critical switching angle 6. can now be 
derived. 

For the faulted system, 


v2 
2 


= B,0 + cos @ — By — cos A 


For the final system, the electrical parameters are changed, and | 
(33) 


M2912 = Tire — Terre SiN 91 ¢ 
which can be reduced to 
d70 


ime 


—sin(@ — ¢,)]. 


where 


T, 
Sy rae 9 =61. + Piogs Piop=are tan Geers 
1 


* tan diay) 


(35) 
Fate Blt age | 
$c = (Pio — F120) C= oe: 
el2f 


Where the suffix fis added, the parameters refer to the faulted — 
system. 


Now eqn. (34) may be integrated as before, giving 


a b.)] - 


It is required to find the point C given in Fig. 2, the intersection 
of the faulted system trajectory with the separatrix in the final 
system. 


Eqn. (36) will give the separatrix if 


= C[B,O + cos (0 — $,) — B.09 — cos (Oy (36) 


0) = 7 —arcsinB, + ¢, . (37) 


(29) 


G1) 
(32) 


(34) 


ee ee ee en, eee ae Le MCR 5 ey ie po ei a: 
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|e. 0) = 7 — 0, + ,, where 6;, is the angle of the vortex 
|point. Substituting for 6 in eqn. (36) gives 
1 v2 

i C[B.8 + cos (0 — d,) — Bm — 6; + $,) +. cos 6,.] (38) 


The critical switching angle @, will correspond to the inter- 
‘section of the trajectories given by eqns. (32) and (38). 

Substituting 6. for @ in each, and equating and rearranging 
‘ gives 


(By — CB,)0, + cos 8, — C cos (8, — $,) 


ia ByOq — CB(a — 6,;, + $,) + cos % + Ccos A, (39) 
1 Now if P12 — Pir then Le = 0) 
(and Tonite = £mi12f 
\ whence a Tni2e ie Te12F x ide: Pe. 


‘ Leioe T e126 Te12F Cc 
| Under these conditions eqn. (39) reduces to 


((1 — C) cos 8, = cos 0) + Ccos O,, — Bela — (Oy + 9,.)] (40) 


(7.1.2) Derivation of Approximate Formula for Critical Clearing Time. 
it is proposed to replace the expression under the root sign in 
ean. (11) by a quadratic in 6. The expression can then be 
i integrated in terms of inverse trigonometrical functions. 
It is useful to change the variable so that the limits of integra- 
tion are +6,,, where 


The expression under the root sign in eqn. (11) becomes 
1 F,(@) = B,8 + cos (8,, + 8) cos 8 
— sin (6, + 99) sin 8 + BO, (41) 


If a quadratic function F,(6) is used to replace F,(@) in the 
«equation in such a way that the functions have equal value at 
—6,,, 0 and 6,, and if 


F,(8) ae [rn si R? a (42) 


‘then a and R will have the values given in eqn. (13). The 
(critical clearing time may be found by replacing F,(@) by F,(8) 
iin eqn. (11) and then integrating, giving eqn. (12). 


— cos Oy 


(0, — R)*]a 


(7.2) The Energy Integral for the Three-Machine Case 
The differential equations are 


Es Vee A 
| M,o; = Tani Z,, sin dit —— Za sin (O15 = $12) 
Vives 
— 13 sin (613 — $13) 
3 
; Ve. VoV,.. 
M2, = Tr — Z, sin p22 + Zs sin (O42 + $12) | 
Ki . (43) 
VoV. 


$23) 


z i sin (8,3 — 


; V2. V3V,. 
i 14,0; = Ty3 — =* sin $33 + Za sin (9,3 + $43) 


233 


VV sin (823 + $23) 
3 


J 
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These equations arise from the general expression when 


6; — 6, = 615 =>— 051, etc. 

The relative acceleration equations for every possible com- 
bination of the machines [i.e. n(z — 1)/2 equations] must be 
established. Each equation must then be multiplied by the 
appropriate relative velocity, and the resulting equations added 
together. For three machines, this gives 


M,M76 1202 + MyM36>3023 + M,M36,36,3 


= Tyy12612 + Tm23923 + Tn 931 
met ole 3 : 
: 3 lM, sin (O12 — $42) + My sin (O12 + $42)]O12 
V. : : : 
75 Ms sin (023 — $23) + M2 sin (0,3 + $23) ]A23 
V, : : ; 
7, Ms sin (0,3 — $43) + M, sin O13 + $43) ]813 
VV. . : ; 
oa + M3 [sin (012 — $42)813 — sin O42 + $42)623] 
me V. 
7 me [sin (823 + $23)013 — sin (823 — $23)942] 
VV: ; ; P , 
= os ee [sin (013 — $13)042 — sin (613 + $13)623] (44) 
where 
V2. 
Ti = Mz (Tw a Zz, sin br) 
V2 
= My (Ta = Ze sin $22) 
Tni3 M;( re sin bu) as 
re M, (Ta — sin bss) 
V2 
| Ba ros M,( = Zz, sin bor) 
<3 M,(Ty3 Z sin bss) 


The first three terms containing sines on the right-hand side 
of eqn. (44) can be directly integrated, but not the last three 
terms. 

One method of approximation is to assume that ¢,, ¢;3 and 
¢3 are zero in the last three terms only. The first of this group 
of terms then becomes 


a = ui V, 


Dok j 
in 0,50 
MO 12712 


{sin 8120613 653)] = . (46) 


and similarly for the other two terms, and all can then be 
integrated. 

Greater simplicity will be obtained at some sacrifice in accuracy 
if the small angles 417, 413, etc., are assumed to be zero in all 
terms of eqn. (44). If at the same time the values of T),12, Tints 
and T,,,23 are modified so that eqn. (20) is satisfied, the integral 
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will then have the correct value in the region of greatest interest, 1 ‘9 > » 
close to the singular (saddle) point. 2M, + M, = Mae 1M67, + M)M363; + M,M,673) 
Then 
Vivo. 
View V,V3.. Se, [sin O15. (912 — 442-) + cos 642 — cos Pir] 
Tni2 el sin. O45, + —— sin O36 12 
Z12 Z\3 VV. (48) 
2'3r.: me 
Poe + sin 053,(8.3 — 853,) + cos 05, — cos 0 
tere ee a 2 Sein Bia) (47) Ze [ 23c\023 23. 23 23] 
2 23 
Vi V3 . 
Viva Vane ++ [sin 0)..(0,3 — Oyen) COS.0, 4 Case, 
Fae A ein Oe ot - Sathy. Za [sin 413-913 — 136 13 13¢] 
a od This gives a simple equation to use in practical problems such 


and eqn. (25) reduces to as the example given in Section 3.5. 
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MULTI-GAIN REPRESENTATION FOR A SINGLE-VALUED NON-LINEARITY WITH 
SEVERAL INPUTS, AND THE EVALUATION OF THEIR EQUIVALENT GAINS 
BY A CURSOR METHOD 


By M. J. SOMERVILLE, B.Sc., Graduate, and D. P. ATHERTON, B.Eng., Student. 


(The paper was first received 26th February, and in revised form 15th May, 1958. It was published as an INSTITUTION MONOGRAPH in 
July, 1958.) 


SUMMARY 


A general method is described for calculating the equivalent gain 
‘through a non-linear element for a signal of known amplitude 
‘probability distribution, in the presence of other, uncorrelated, signals 
sor disturbances, also of known amplitude-probability distributions. 
‘A cursor method for calculating the equivalent gains is developed, and 
‘cursor results are verified by theoretical and experimental points, for 
‘the equivalent gains of Gaussian and sinusoidal signals applied together 
(to certain non-linearities. An example is given, showing how the 
behaviour of a non-linear feedback system may be analysed when two 
‘signals of different amplitude probabilities are applied to it. 

It is shown also that the equivalent gain to a Gaussian noise signal 
rin the presence of other signals applied to a single-valued non-linearity 
has the same value as the incremental d.c. gain through the non- 
liiearity (with the input signals present). Experimental results are 
\given using this as a method of obtaining the value of equivalent gain. 


(1) INTRODUCTION 


When several inputs are applied together to a non-linear 
‘element, the output consists of components due to each separate 
input, plus harmonics and cross-modulation products of the 
(inputs. It is desirable to replace such a non-linearity by an 
‘equivalent linear gain (defined as being that which minimizes 
mean-square error!), since calculations on its effect in a feedback 
system are thereby facilitated. If these inputs to the non-linear 
element are of different amplitude probabilities or different 
magnitudes of similar waveforms, the equivalent linear gains (of 
lall the signals) do not have the same values, and each separate 
gain is a function, not only of the magnitude of the particular 
input, but also of the magnitudes of each other input. An 
approximate method? which has been used to simplify calculation 
of the equivalent gain of a non-linear element with two input 
signals A and B assumes that the gain for each signal has the 
same value k 4, (Fig. 1); the equivalent gain is then effectively 


SIGNAL A 


SIGNAL B 


Fig. 1.—Simplified single equivalent gain for two signals to a 
non-linear element. 


ti:at which gives minimum mean-square error between the out- 
pet of the non-linearity and that of the equivalent linear gain. 
This method is found to give reasonable accuracy for a combina- 
tien of a sine wave with Gaussian noise applied to a saturation 
son-linearity, but large discrepancies can result if this simplifying 
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approximation is made for certain other non-linear elements. In 
the precise method of calculation the equivalent gain for each 
input signal is effectively that which gives minimum mean-square 
error as the equivalent gain for that particular signal is varied 
(Fig. 2). Separate minimizations of error are carried out to 


SIGNAL A 
SIGNAL B 
SIGNAL C 


Fig. 2.—Miulti-gain representation of a non-linear element. 
k4, kp and kg are adjusted separately to give a minimum mean-square value for xq. 


find the equivalent gains for each of the other signals. The 
representation of a non-linearity by a particular equivalent 
gain for each of the input signals will thus result in a smaller 
mean-square error than if the simplifying approximation, that 
all equivalent gains have the same value, is taken. 

The equivalent gain for a very small sinusoidal signal applied 
with other input signals to a single-valued non-linearity will have 
the same value as the incremental d.c. gain (in the presence of the 
same input signals) through the non-linearity. If one of the 
input signals is a Gaussian noise signal x;, for which p(x;) 
= [1/\/(Qzm)cle—*7/?”, it may be considered as being composed 
of a large number of small sinusoidal signals c, cos (w,t + ¢,) 
(see Reference 3), where ¢,, is randomly distributed, with a uni- 
form probability distribution from 0 to 27, and c,, are constants 
describing the frequency spectrum of the noise signal. Thus 
the gain for each of the small sinusoidal components of the noise 
will equal the incremental d.c. gain which occurs when all the 
input signals (including the Gaussian signal itself) are present. 
The equivalent gain for the whole Gaussian signal will thus have 
the same value as that of each of its components, and the gain 
for any Gaussian input signal in the presence of other signals 
applied to a non-linearity, has therefore the same value as the 
incremental d.c. gain with all these inputs present. A proof of 
this property is given in Section 4. 


(2) THEORETICAL ANALYSIS OF EQUIVALENT 
GAINS 
A single-valued non-linearity is assumed to have the charac- 
teristic v =f(v), Vo and v; being the output and input 
signals respectively. An input v; = (6 + x + y) is applied to 
the non-linearity (Fig. 3). [ is the d.c. level of the input; 
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0.c. LEVEL 
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LINEAR 
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Vout (V;) 


Fig. 3.—Equivalent gains for signals x and y. 


a.c. inputs x and y are taken to have zero average value and are 
assumed to be uncorrelated. In the general case, x is the signal 
for which the equivalent linear gain is to be determined, and y 
is the sum of all other signals y; ... »,. The amplitude proba- 
bility distributions of x and y are respectively written g(x) and 
p(y). For simplicity, y will first be assumed to represent only 
one signal. The general case where y = y; +... y, is treated 
in Section 10.1. 

For the input (8 + x + y) to the non-linearity, the output 
is % =f(B +x +), while that from the equivalent linear 
gains has the value (Kx + ky). The difference x, between 
the outputs from the non-linearity and equivalent linear gains 
represents cross-modulation products and harmonics of the input 
signals, and also includes the d.c. level A at the output from the 
non-linearity. 

The instantaneous value of this difference or error is 


Ny (Pp tty) (KX ky) (1) 
and its mean-square value is 
it 
1 
M = | xpdt . (2) 
lim T+ ~0 


xX, is a function of x and y, both of which are functions of time 
and are partly defined by their amplitude probability distributions 
q(x) and p(y). The probability that x lies between x and (x + dx) 
while y also lies between y and (y + dy) is g(x)dx. p(y)dy, 
provided that x and y are uncorrelated. This joint probability 
is in fact the proportion of time spent in this condition, and the 
corresponding contribution to the integral of eqn. (2) is therefore 
x3,q(x)dx . p(y)dy. For all values of y, with x between x and 
(x + dx), the total of such contributions is 


y=+o 
i x74(x)p(y)dxdy 
y=-—0 
The total contribution from the sum of these components for 
all values of x is thus the mean-square error M, and has the 


value 
Via e9) 


M= | <davopodxay 
y=-0 


(3) 


Putting the value of x;, from eqn. (1) into eqn. (3) gives 
Ito 
LPB +849 -2f8 +x +x + hy) 
+ (Kx + ky)"]a(x)p(y)dxdy (4) 


As the value of K is adjusted, a minimum mean-square error is 
obtained when 0M/dK = 0; 


x 


DAee. (a eee 
=| [ [-27@+x4 


wT Kx? + 2kxy]q(x)p(y)axdy = 0 
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——————— Ss 


Now 
x=+o ~.y=+0 x=+0 y=t+o 
| 2kxyq(x)p(y)dxdy = 2k xq(x)dx | yp(y)dy 
x= — 0 “yo - x=—-o0 yru-o 


+ 00 
=0 it | Seay: Se 


+0 
or if | yp(y)dy = 0 


Thus the above part of the expression for )M/dK is zero if the 
average value of x or of y is zero. This assumption was in 
fact made at the outset to make this term zero. The condition 
that the input signals x and y have zero average level does not 
restrict the analysis, as may at first appear to be the case, but 
means simply that the gain to a signal which is made up of a d.c. 
component plus an a.c. component (of zero average level) must. 
be represented by two different gains, one for the d.c. and the 
other for the a.c. component. The latter is the equivalent gain 
K (or k) and the former is represented by the incremental gain 
for d.c., and has the same value for all d.c. components, which 
are lumped together for convenience to give the d.c. level f at 


———. 


the input. 

Thus, . 
2M x=+0 ~.y=+0 . 
<= Owhen| [-2f8 + x + y)x 

ih) DES Ble + 2Kx?]q(x)p(y)dxdy = 0 ] 

Whence { 
x=+0 y=+o0 ; 

J I xf(B + x + y)q(x)p(y)dxdy 

bee he y=+o | 

J | eacopoaxay . 

x=—0 ~y=—o : 

y= +00 ; 
But | p(y)dy = 1, therefore j 
y=— 00 3 

1 x=+0 ,.y=+0 { 
Kaa) | B+ e+ yaCpoyady © © 


oh 


where # is the mean-square value of the input x. Similarly, 
differentiating M [eqn. (4)] with respect to k, and putting 
0M/dk = O gives 


ee ee | 


1 Y=H=+O .xx=+0 
k= sa) J fB+x+ no meade . 
where j* is the mean-square value of the input y. | 


+00 
As y tends to zero and J P(y)dy = 1, eqn. (5) gives the 


usual expression for the equivalent gain K of a signal x in the 
presence of a d.c. input f, 


x=-+ 0 
i 


| SB + xaedds OF 


(3) TWO-STAGE EVALUATION OF K 


For a d.c. input y applied to the non-linearity together with 
the inputs (8 + y), the d.c. output from the non-linearity has 
a value : 

y= +0 


ay) = | fy +B + ypvdy 7 


(8) 


Suppose now that the function g(y) given by eqn. (8) is the 
characteristic of a single-valued non-linearity, and that the 
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‘nput x is applied to it. The equivalent gain K,. of the signal x 
-hrough this non-linearity with characteristic g(y) has then the 
walue [cf. eqn. (7)] 


x=+0 


1 
Kee =| XE (x)qQ)dx resin aeebres. (9) 
Putting in the value of g(x) obtained from eqn. (8) gives 
x=+0 


1 
Ee= | 2 


y= + 


LO + B + Yp()dvatddx 


1 x=+0 


Vig ree 
= sa) [a8 +x + dacopiyaxdy 


Comparison with eqn. (5) shows that K, = K, i.e. the equivalent 
gain through the non-linearity for a signal x in the presence of 
signals (6 + y) is equal to the gain of x applied alone to the 
Peffective non-linearity’ with the d.c. input/output characteristic 
2(y) of eqn. (8). 

This result, proved above for y a single signal, is shown in 

Section 10.2 to be true also for the general case when 
ry, +. 
The process ice evaluating the equivalent gain K may now be 
dene by a relatively simple two-stage method, in which the 
bharacteristic g(y) is first calculated, and the cain of the signal 
x through this effective non-linearity is then obtained. This 
method proves particularly amenable to the use of cursors* 
“Section 5), one to obtain the characteristic g(y) and another 
to evaluate the equivalent gain of x through the non-linear 
tharacteristic g(y). In the general case (Section 10.3)  d.c. 
cursors are used to construct the characteristic ie one cursor 
being required for each of the inputs y;...y,... 

The equivalent gain k, for the signal y, may be ere in 
che same way as that described above, and is the gain of the 
signal y, through the ‘effective non-linearity’ represented by the 
d.c. input/output curve when inputs (B +x+y-—y,) are 
applied to the non-linear element. 


(4) INCREMENTAL D.C. GAIN 
It follows from Section 3 that the gain for a very small] sinu- 
s0idal signal will have the same value as the slope at the origin 
of the d.c. input/output curve for the non-linear element with all 
the inputs applied. This slope is the incremental d.c. gain 
-hrough the non-linearity, and its value is of some impor- 
‘ance, since incremental (Nyquist) stability of a feedback loop 
containing the non-linear element will be determined by the 
magnitude and sign of the gain to small sinusoidal signals. 
For an input (8 + x+y) to the non-linearity, the d.c. 
yutput A has the value [cf. derivation of eqn. (3)] 


yH=+o 


LIE + x + Dacre ards 


x=+0 


ele 


—. = incremental d.c. gain 


; 


-[ ae 4, ie +x + y)q(x)p\y)dxdy 


—0o “y=—o 


WOW, 


d 
wares eared. 


Sf@tx+y= SfB+xty) 
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Thus 
dX y=H=+o x=-+ 00 
dB = PY) {f aeoats (8 +x + me 


= f oy {[16 +x + yaa]** 


y=-0 
x=+0 


ae fE +x+ ypalavo) ly 
Assuming g(x) = 0 atx = +0 


A Ment 
es PY  { 


When the signal x has a Gaussian amplitude probability distribu- 


x=+ 0 


| f(B +x + y)d[q colhay eg a(U) 


x=-—0 


, 1 
tion, q(x) = ae where o is the r.m.s. value % of the 
signal x, V(2m)o 
1 —2x 
a x2/262 See 
dla] = Fams( a2 yer dx = — Sa(x)dx 

Putting this value for d[q(x)] into eqn. (10) gives 

dN {dg (oe ae 

—=s=s xf(B + x + y)q(x)p(y)dxd: 11 

dB #1, 278 Pp p(t) 


Comparison of eqn. (11) with eqn. (5) shows that, for a Gaussian 
signal x, its equivalent gain in the presence of signals (8 + y) 
is equal to the incremental d.c. gain through the non-linearity 
in the presence of the inputs (8 + x + y), ie. 


K = ddB (12) 


Section 10.4 gives a proof of eqn. (12) for the general case when 

=); +...y,. Thus, if several Gaussian signals are included 
among the inputs x, y;,... y,, then each of the Gaussian signals 
has the same value of equivalent gain, equal to the incremental 
d.c. gain. This fact enables all Gaussian signals at the input to 
a non-linear element to be conveniently lumped together as a 
single Gaussian signal of mean-square value equal to the sum 
of the mean-square values of each component. This technique 
is used in the example given in Section 7. 

Eqn. (12) provides a useful practical method for measuring 
the equivalent gain for a Gaussian signal. Direct measurement 
is obtained, for example, by adjusting the gain k, (input A in 
Fig. 2 being the Gaussian signal) to give a minimum mean-square 
error. The accuracy obtained by this method is usually low 
(about +20°%) since the null of the mean-square error can be 
obscured by the presence of cross-modulation products and 
harmonics. Also, since the mean-square null instrument must 
have a long time-constant to smooth out random low-frequency 
fluctuations,° it can take a considerable time to obtain the null 
balance. Measurement of incremental d.c. gain, on the other 
hand, does not involve a null adjustment, and can give good 
accuracy. To obtain an accurate measurement of incremental 
d.c. gain by this method, only a small d.c. increment should 
be applied to the non-linearity, the corresponding d.c. incre- 
ment at the output should be measured on an instrument with 
a long time-constant to smooth out random low-frequency 
fluctuations, and several points for positive and negative incre- 
ments should be taken to allow, by interpolation, for curvature 
of the d.c. input/output characteristic. 


(5) CURSORS FOR CALCULATING EQUIVALENT GAINS 


Two cursors are required to evaluate equivalent gains by the 
method outlined in Section 3; a ‘d.c. cursor’ to give the effective 
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non-linear characteristic g(y) in the presence of signals (B+ y), 
and an ‘a.c. cursor’? to give the equivalent gain of the signal x 
through the effective non-linear characteristic g(y). 

In the general case, y= jy +.--¥p T+ + +n (Section 10.3), 
the d.c. cursor for y, is applied to the equivalent non-linear 
characteristic g,_(y,_1) to obtain the equivalent non-linear 
characteristic g,(y,). This procedure is repeated n times for 
r=1 to r=n, starting with go(yo) =f(P + yo), the actual 
non-linearity, until g(y) = g,(y,) is obtained. 

The a.c. cursor for x is applied to the effective non-linear 
characteristic g(y) to give the equivalent gain K to the signal x. 
When the equivalent gains k, . . . kK, for all the signals y,. . . Y, 
are required as well as that for x, the procedure will be similar 
to that described above for x, and in this case both a d.c. and an 
a.c. cursor will be required for each of the signals applied to the 
non-linearity. 


(5.1) The D.C. Cursor 
In the general case (Section 10.3), 


yr=+o 
&AYr) =| ENO FT LP UAOHIS Fw tus (OE), 
B= CS 
If y is a single signal, eqn. (13) reduces to 
y=+o 
ey) =| fy +B + yey 
y=— 00 


The general eqn. (13) may be rewritten 
1 
EY) = [ &r—1) Yr tie yl | x04, ° (14) 


i 
The value of i P,(y,)dy, increases continuously from zero with 
===) OO) 


Y= — 0, to +1 with y= +00. If N values of Y (1%... 
Yr... Yn) are obtained, such that 


i; 1 Y(r+1) 1 aoe 1 
Oo, =a.-.| v0), ——...| eer 
E Ona, = ay ee. Od, = x y Pray, aN 

the values Yp then represent the most probable values (i.e. mean 
position) of y, over the range 


1 “he R 
we a line PV) Me S NW 

Thus the integral of eqn. (14) may now be evaluated approxi- 

mately by the summation 


{| ey 
&r(Yr) = N p> 8-1, i Yr) O . (15) 


Taking N= 20 usually gives a reasonably accurate result. 
The values of Yr for sinusoidal and Gaussian signals are 
calculated in Section 10.5 and tabulated in Table 1 for the case 
when N = 20. 

Fig. 4 gives a diagrammatic representation of the values of 
Yr, for the cases when E = 1-0h and 2'?¢ = 1-0h (ie. equal 
powers of sinusoidal and noise signals, normalized relative to 
E=h). If Fig. 4(@) or (6) is drawn on tracing paper, it can be 
used as a d.c. cursor by placing it over a graph of the non- 
linear characteristic (or effective non-linear characteristic), when 
the intersections of the vertical lines y= Y,;...Yp 1 50% 
with the non-linear characteristic g,,_)(y,_1) give the wales 
&r—1(7, + Yr). Thus, to obtain the d.c. output g,(y,) for an 
input y,, the d.c. cursor for y, is placed over the characteristic 
8&,—1(Y,—1), With its origin at the point y,_, = y,, and the values 


Table 1 
VALUES OF Yr ON D.C. Cursor FOR (a) SINUSOIDAL AND (b) GAUSSIAN SIGNALS 
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(6) Gaussian: Yp[2120 


(a) Sine: Yr/E 
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(b) 
Fig. 4.—D.C. cursors. 


(a) For sinusoidal signal. i 
(6) For Gaussian noise signal [E = 4/(2)o = 1-0A]. 


&-1(7, + Yr) are summated [eqn. (15)]. This process is 
repeated for a range of values of y, to obtain points for the 
| curve g,(y,). 

The d.c. cursors of Fig. 4 are of use only for the one signal 
magnitude and must be redrawn for each required value. 
Fig. 5 shows the d.c. cursor for a sinusoidal signal drawn (cf. 
Reference 4) in such a way as to enable it to be used over a range 
of normalized values of E between 0 and 2h. In using this cursor, 


2h 


y 
ff} 
ij 


2°Oh 


i 
I 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
] 
| 
| 
| 

-2:Oh | 


E 
Fig. 5.—D.C. cursor for sinusoidal signals of magnitudes 0 to 2:0h. 
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the point Yp on the particular horizontal line corresponding to 
the required value of E is made to coincide with the curve 
&(r—1L¥~—1)] by moving the cursor vertically. The ordinate 
of this point of coincidence gives only one of the N values 
&(r—1)(¥r + Yr), and the cursor has to be manipulated in the 
above manner for each of the N values to be taken. Since the 
cursors of Fig. 4 prove somewhat easier and quicker in use than 
that of Fig. 5, the former would be used in preference to the 
latter, as is convenient, for example, when only the gain for the 
signal x is required, in the presence of constant amounts of 
signals y = yj +... ), 


(5.2) The A.C. Cursor 


From eqn. (9) (and Section 10.2) the equivalent gain for a 
signal x is 
eee. 
K= | g@xqndx 
xX“ ~— «© 
Since X is the r.m.s. value of the input signal, the r.m.s. output 
$(x) from the equivalent gain has the value 


+00 
DX) = Kx | s@)2aeddx 


which may be written 


=F 100 
$(x) = | g(x)d || “ats (16) 


Xx 


58 : P 
The value of | —q(x)dx increases in value as X moves from —©oo 
5% 
— 3} 


to zero, but starts to decrease for positive X, and becomes zero 
for X¥ = + ©, since x has been taken to have zero d.c. component. 

0 foe) 

Re x 3% 1 
Thus — | —qx)dx= | =qa)adke = = = — 

Les > 
where x,, is the average value of the signal x, and F is its form 
factor, r.m.s./average. 

An approximate evaluation of the above integral may now be 
obtained, as follows, in the same way as was used for derivation 
of eqn. (15) for the d.c. cursor. N values of X(X,...Xp...Xy) 
are thus calculated, such that, for negative X, 


X X(R+1) 


5s —1 1 xy 
ate SS Se a5 56 || SECO! 
| fee 2F 2N | £eco ¥ 
ae SG 0) 
Zip al x iwi 
ope | Sacnas > OF oy 
X(y/2) 
and for positive X, 
X(w/2+1) X(R+1) 
G it il x 
a = a e552 || SAGE! 
| Seco IF 2 | £200 x 
0 XR os 
(art x 1a 
— oF N _| Sects = se 
Xy 


Then the integral of eqn. (16) may be evaluated approximately 
by the summation 


; Se eS 4 Ee ae ) (17) 
d(x) = 2F| PF aoe Bor iiok : 
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(N is an even number.) The values Xp for a sinusoidal and for 
a Gaussian signal are calculated in Reference 4 for the case 
when N = 20, and are reproduced in Table 2. 

The values of Xp shown in Table 2 are now used to construct 
an a.c. cursor’ which is similar to the d.c. cursors (Figs. 4 and 5). 
The a.c. cursor is used in much the same way as was the d.c. 
cursor, to obtain the ordinates g(X) which are summated and 
divided by 2F to give the value of ¢(x) for the particular input- 
signal magnitude, x. | 

When an input signal contains d.c. as well as a.c., it is split — 
into the separate components Xo (d.c. level) and x (a.c. with zero 
d.c. level), since the d.c. gain is specified by the incremental d.c. 
gain, and the gain for x is its equivalent gain K. (It is necessary 
to split up the signal in this way, since the gain K and the incre- 
mental d.c. gain dA/d8 do not in general have the same value.) 
The equivalent non-linear characteristic g(y) for the signals y 
together with all d.c. components other than xg is then con- — 
structed so that, in determining the equivalent gain K for various — 
magnitudes of the a.c. component x, the cursor must each time 
be used on the modified characteristic g(y + Xo). 


eed SS 


1 
O2d25e ee. 
Or 31 2.a. 
+0-231 


10 
0-475 
—0-312 
—0-231 


—0-403 


—0-536 


(5.3) Evaluation of the Joint Probability p(y) for Signals 
Vy ata ears Vy 
The equivalent non-linear characteristic g(y) may be obtained 
by a method alternative to that described in Section 5.1, by 
determining the overall amplitude probability distribution p(y) 
(y= y; +...y,), when, from eqn. (8), 


—0-658 


+ 00 
e=|fo+B+yp0dy . . . C8) 


—0°773 


In certain cases, this method can prove simpler and quicker than 
that of Section 5.1. For example, if the gain for the signal x is 
required for a range of values of y (the relative values of y; ...¥, 
remaining constant), then determination of the characteristic g(y) 
for each new value of y involves only a single-stage cursor 
operation to evaluate eqn. (18), whereas use of the method of 
Section 5.1 would require n stages of cursor operation. 


Table 2 


= O2093 


q The total amplitude probability distribution #,(z,) for a signal 
5 Zy = ¥, + y2 is obtained from the convolution integral 
V2 sie 
bi) =| rile —ydrddr . . . (9) 
y2=— 0 


For a signal z3 = yj + yp + 3 = 2 + V3, 


VALUES OF Xp FOR (a) SINUSOIDAL AND (b) GAUSSIAN SIGNALS 


SINS /7/ 


3 aac 
t3(z3) = J t(Z3 — ¥3)p3(¥3)dy3 
Y3=— 0 


—1-379 


Putting in the value of t,(z; — v3) from eqn. (19), 


Y3= +0 py2=+0 
t3(z3) = i . k P3(¥3)P(¥2) P1123 — (2 + y3)|dy3dyz 
ee OL) 


y3=— 0 


—1-733 


To obtain the probability distribution p(y) when y = y, + 
..- JY,» convolution integrals 


& r= +00 
oles t,(z,) = J tr—1)(Z, 7% VPP AYy)dy, . . . (20) 

Qo Spe assy 
oF a are carried out, from r = 2 to r = a, to give 
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Eqn. (20) can be written 


t,(¢,) = i te -n(@ — »A] [rrdy, 


Comparing eqn. (22) with eqn. (14), it is seen that an approximate 
evaluation for ¢,(—z,) may be obtained by using the d.c. cursor 
for the signal y, on the characteristic t,_ ,(—z,_ ), in just the same 
manner as was described in Section 5.1, where the characteristic 
-g(y,) was constructed from &—1(¥,-1)- Thus the total ampli- 
tude probability distribution p(y) is obtained by applying the d.c. 
cursor in a total of (n — 1) stages. 
Now, rewriting eqn. (18) in the form 


gly) = [te +B+yd | [ocr 


the effective characteristic g(y) may be constructed from the 
actual non-linear characteristic using the d.c. cursor for the 
signal y. The values Yp for this cursor may be obtained from 
graphical integration of the curve p(y). 


(22) 


(6) THEORETICAL AND EXPERIMENTAL EQUIVALENT 
GAINS 
Theoretical results for the ‘dead-space’ non-linear charac- 
teristic (Fig. 6) have been constructed by the cursor method, 


Fig. 6.—‘Dead-space’ non-linear characteristic. 


/ and with check points obtained, when possible, by calculation. 
(Even for only two inputs to the non-linearity, the purely 
| theoretical evaluation of their equivalent gains can be extremely 
complex, and the integrations would often have to be evaluated 
by numerical methods.) For convenience, the signals x and y 
_ are written in normalized form, relative to the fixed reference 
level h. A signal of r.m.s. value ¥ is taken to have a normalized 
value 2!/2X/h: in this way a sinusoidal signal of peak value h has 
a normalized value of unity. 

The curves of Fig. 7 are for an input to the dead-space non- 
linear characteristic consisting of a sinusoidal signal plus 
Gaussian noise, of normalized values P,, and o,, respectively. 
For P,, ~ 1:0 and low values of o,, (e.g. P, = 1:0, o, = 0°5), 
the difference in equivalent gain K to signal P,, and the equivalent 
gain k to noise a, is appreciable. These results for the dead- 
space characteristic may be used to construct curves for the same 
‘aputs to other non-linear characteristics. A ‘limit’ non-linear 
characteristic represented by Fig. 8 may be written [1 — D.S.(A)], 
where D.S.(9) denotes a dead-space characteristic with ‘break’ 
soints at +0, and with unity slope for inputs of magnitude 
greater than 6. The ‘limited field of view’* (1.f.v.) non-linear 


* An example of this type of non-linear characteristic is that of certain radar 
“irection-sensing aerials, where the error signal from the aerial becomes zero for 
irge positive or negative angular displacements. 
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Fig. 7.—Equivalent gains for a sinusoidal signal plus Gaussian noise 
applied to a ‘dead-space’ non-linearity. 
(a) Gain K for a sinusoidal signal P». 
(b) Gain k for Gaussian noise op. 
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Fig. 8.—Construction of a limit characteristic from a ‘dead-space’ 
characteristic. 


characteristic shown in Fig. 9 may be written [1 — 2D.S.(A) 
+ D.S.(2h)]. Equivalent gains for sine wave plus noise signal 
to these non-linearities are shown in Figs. 10 and 11, together 
with experimental points. 

Theoretical results for three signals applied to the dead- 
space non-linearity of Fig. 6 are shown in Fig. 12. The three 
signals taken are two sine waves (P,, and S,,) of unrelated fre- 
quencies, and Gaussian noise (a,). Equivalent gains for these 
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Fig. 9.—Construction of a ‘limited-field-of-view’ (l.f.v.) characteristic 
from ‘dead-space’ characteristics. 
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Fig. 10.—Equivalent gains through the limit non-linearity for a 
sinusoidal signal plus Gaussian noise, 


(a) Gain K for a sinsuoidal signal P,. 
(6) Gain k for Gaussian noise cp. 
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Fig. 11.—Equivalent gains through an L.f.v. non-linearity for a 
sinusoidal signal plus Gaussian noise. 


(a) Gain K for a sinusoidal signal Pp. 
(b) Gain k for Gaussian noise op. 


three signals through an 1.f.v. non-linearity have been obtained 
from those for dead-space (Fig. 12), and are shown in Fig. 13 
together with experimental points. 


(7) DETERMINATION OF OPERATING POINTS ON THE 
NON-LINEAR CHARACTERISTIC 

When the non-linear element is included in a feedback loop, 
the operating point for each signal is a function of all the other 
signals present at the input to the non-linearity. Thus, if 
the non-linearity is represented by WN different equivalent 
gains, the operating point is obtained, in effect, by the solution 
of N simultaneous equations. If a mathematical solution is to 
be attempted, it is first necessary to represent the gains K and 
k, . . .k, by equations involving the magnitudes of all the 
signals x, y;...¥,. For example, with sine waves and Gaussian 
signals applied to the non-linear characteristic of Fig. 143, an 
approximate expression which holds over a limited range of 
values for P,, and o, can be obtained from the curves of Fig. 16: 
for small deviations of P,, and o,, about a normalized value 1-0, 
K ~ 3-2 +1-5(P, — 1:0) + 2-8(¢, 1:0) and k ~ 3-96 
+ 1:96(c, — 1:0) + 2:8(P,, — 1:0). When only two different 
equivalent gains are needed to specify the non-linear element, a 


relatively simple graphical construction can be used to find the 


operating points. To illustrate this construction, the second- 
order feedback system of Fig. 144 containing a hard-spring 
non-linear characteristic (Fig. 148) will be analysed. The three 
inputs to the system are a sine wave of frequency 1:5wp. and 
two Gaussian noise signals v,, and v,, the frequency spectrum 
of v,,; having 6dB per octave low-frequency attenuation from 
wo/8 and 6 dB per octave high-frequency attenuation from wo/4, 
and v,. having a similar spectrum with low-frequency attenuation 
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Fig. 13.—Equivalent gains through an I.f.v. non-linearity for two 
sinusoidal signals plus Gaussian noise. 


(a) Gain K for a sinusoidal signal Py. 
(b) Gain k, for a sinusoidal signal Sp. 
(c) Gain k2 for Gaussian noise op. 


beginning at wy and high-frequency attenuation beginning at 2wo. 
These noise signals are present in the ratio v2, :v2,=4: 1, 
and, having the same value of equivalent gain, can be considered 
as a composite noise signal of total noise power vz = v2, + v2. 

For a signal v; applied to the system, the signal vp at the input 
to the non-linear element has the value 


°o 05 10 15 20 
P. 

(¢) (p/w)* 

Vp = % 5 ra 
. : 5 [(p]/~) oT. Keq] Se K (p/p) | 
Fig. 12.—Equivalent gains through a ‘dead-space’ non-linearity for 

BO, SUT SEO ess 2a USSiai Oise. where K,q is the value of equivalent gain through the non- 
(a) Gain K for a sinusoidal signal Pn. linear element, and « is the damping factor Tw 9, which is taken 


(b) Gain k, for a sinusoidal signal Sp. 


(c) Gain k2 for Gaussian noise op. to have a value 0:2. 
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Fig. 144.—Non-linear feedback system. 
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Fig. 14B.—‘Hard-spring’ characteristic. 


For a sinusoidal input signal V,, of frequency 1-5w, the signal 
P,, at the input to the non-linear element has a magnitude 


Dies 
Pn Vg ‘09K* — 4-5K + 5:06)!/2 
where K is the equivalent gain to a sine wave. 

The ratio P,/V,, is plotted in Fig. 15 [curve (a)] for a range of 
values of K from | to 5. 

An expression similar to that of eqn. (24) can be obtained 
for each of the noise signals by integration over the power 
spectrum of the noise signal. The power spectrum ®(w) of the 
input noise signals is given by 


(24) 


7) 2 


(p + w)(p + wy) 


where w, and w, are respectively the frequencies at which low- 
and high-frequency attenuations of 6dB per octave begin, and 
v? is the total power of the input noise signal. 

From eqn. (23), the power gain G(w) from the noise input to 
the input of the non-linear element is, for « = 0:2, 


2, 


O(w) = 2(w, + wo) A 


Vv 


D 2 


p? + 0:2kwop + kw 


in which p = jw and k is the equivalent gain for a noise signal. 


G(w) = 


—— 


E-VALUED NON-LINEARITY 
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Fig. 15.—Magnitudes of signal and noise to the non-linear element. : 


(a) Variation of the sinusoidal signal applied to the non-linear element, with the 


equivalent gain K. 


(b) Variation of the noise signal applied to the non-linear element, with the equivalent 


gain k. 


_ Signal amplitude to non-linear element 
Input signal amplitude 
__ R.M.S. noise to non-linear element 


R.M.S. noise input 


Fig. 16.—Detailed equivalent-gain curves through the ‘hard-spring' 
characteristic of Fig. 15 for a sinusoidal signal plus Gaussian 


noise. 
(a) Gain K for a sinusoidal signal Pp. 
(b) Gain k for a Gaussian noise op. 


The power spectrum at the input to the non-linear element is 
mow O(w)G(w), and its total noise power oc? has the value 


o = | O(w)G(w)deo (25) 
0 


‘This integral is evaluated in Section 10.6 to give the results 


Eon (0-2 + 413-3k) 

mm 10-2 + 16°48k + 220-4k2) 
eases (1 + 6-30k) 

n2 


"(1 + 1-55k + 0-42k2) 


_ The total input noise power v2 is made up from v2, and v2, 
in the ratio 4 : 1, so that v2, = #2 and v2, = 4v2._ Thus 


932 end 
On = Ont st O23 = v2 


E 0:255 413-3) 16 Weel 
5 (0-2 + 16-48k + 220-4k?) 


(1 + 6°30k) 
5 (1 +1-55k + 0-42k?) 


[The ratio o,/v, obtained from the above expression is plotted in 
‘Pig. 15 [curve (6)] for values of k from 1 to 5. 

Using the equivalent gain curves of Fig. 16(a), the curve (a) 
oi Fig. 15 showing variation of P,, with K can be converted to a 
peurve of P,, against o,, the noise required at the input to the 
non-linear element, together with P,,, to give the gain K to P,. 
Seared the equivalent gain curves of Fig. 16(b) can be used to 
‘convert curve (b) of Fig. 15 to a curve of o,, against P;. The 
ycurves P,, o, and o,, P, are plotted together in Fig. 17 for 


10 
x 
s vy, =10 
ee 
0:5 
¥, =0:5 
Vv, =0°25 
all 
ve 
oS OH +0, 15 20 
Pye Pa 


Fig. 17.— Determination of operating points on the non-linear 
characteristic. 


several values of P,, and o,. Intersections of these curves now 
give the operating points for the particular values of V,, and v,,. 
Experimentally determined operating points are also shown in 
Fig. 17. For V,, = 0-5 and v, somewhat less than 0°25, it will 
be seen from Fig. 17 that three intersections can occur: this 
indicates the existence of a jump in operating conditions.® 

For large numbers of inputs to a system such as that con- 
sidered above, the problem is best simplified, if possible, so that 
orly two different equivalent gains need be considered, when the 
analysis can be effected as shown in the example. To help 
achieve this simplification, all Gaussian signals may be con- 
silered together, very low-frequency inputs may sometimes be 
ignored if the signal they produce at the input to the non-linear 
element is small, and high-frequency inputs may be assumed to 
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feed direct to the non-linear element, since the signal fed back at 
high frequencies is small. 


(8) CONCLUSIONS 


The multi-gain analysis of single-valued non-linearities is 
simplified by the introduction of the ‘effective non-linear charac- 
teristic’ which can be calculated for each signal. Further 
simplification can be obtained in that the equivalent gain curves 
for, say, a ‘dead-space’ characteristic can be used to obtain the 
curves for other more complex segmented characteristics. Also, 
the fact that the gain for all Gaussian signals in the input is the 
same enables these signals to be lumped together, as in the 
example dealt with in Section 7. Equality of gain for Gaussian 
signal with incremental d.c. gain is of use in experimental 
determination of the equivalent gain for noise, but it is also of 
significance, for example, when there is a reversal in sign of the 
equivalent gain for noise, as with the lI.f.v. non-linear charac- 
teristic [Figs. 11(6) and 13(6)], under which conditions d.c. 
instability of a feedback loop containing the non-linear element 
would take place. 

If each of a large number of inputs to a non-linearity is 
small compared with the total input signal, the absence of any 
one will not much influence the total probability distribution of 
the rest. The effective non-linear characteristic for each of the 
signals is therefore about the same and may be assumed nearly 
linear over a small central portion, so that the gains for each of 
these signals may be expected to be the same. Under these 
conditions it may sometimes be justifiable to consider a single 
equivalent gain for all the signals, its value being equal to the 
incremental d.c. gain through the non-linear element, in the 
presence of all the inputs. 

The spectrum of the distortion products x, which occurs with 
a noise signal plus several other inputs to a non-linearity is 
of some interest, the low-frequency spectrum® in x, usually 
being of particular importance since this appears as a permanent 
error signal, whereas the high-frequency part of x; may be 
sufficiently attenuated by the filtering action of the servo loop 
to be ignored. In the example dealt with in Section 7, it will 
be seen that, at the actual operating point, the noise signal is 
considerably greater than that predicted. This increase is found 
to be caused by cross-modulation products of the noise and sine 
wave, which in this example can have a considerable magnitude 
at the effective natural frequency of the feedback loop. 
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(10) APPENDICES 


(10.1) Equivalent Gain for a Signal x in the Presence of 
n Other Signals 


An input (8 + x + y) is applied to the non-linear element, 
where f is the total d.c. level at the input, x is a single a.c. signal, 
and y is the sum of all other signals y, ...y,. All input signals 
are assumed to be uncorrelated with one another. 

The output from the non-linear system is Yy = f(8 +x-+y), 
and that from the equivalent gains has the value Kx + (k,y; 


+...k,y,). The difference xy = {f(B +x+y)—[Kx+ (ky 
if 
+...k,y,)] I}, which has a mean-square value M = al xed, 
lim T—> © 0 


Extending the argument used to obtain eqn. (3) gives for the 
general case 


X= +0 y= +0 pyn=+o 
mM=| | vee I XHq(X)P (91) « - - Pal Ydxdy, .. . Vn 
x= OO y 4 © "Yn ee) 
Now 
x= {LSB +x +P —2fB +x +R + ky +.. kad] 
+ [Kx + ay +. +k ydPf 
and 
ry 
Sg OW = 10 — 2xf (B + x+y) + 2x[Kx + (ay +. ke] 
Thus 
IM X= +0 y= +0 pyy=+o 
aK J [—2xf(B +x +y) + 2Kx? 
Ne OO RY 9 CO, Vi) + 2x(kyy,+..-KpVn)] 
4(X)P (1) + Pal ndxdy... Vn 
Now 
X=+0 Y= +0 Yr=+0 
J fee J aetan +. kadar. 
4 % PAV acdy te aay AO 
++ 00 
since i xq(x)dx = 0 (i.e. x has zero d.c. level) 
so that = = 0 when 
x=+0 .yy=+0 
igaasig lentes ral Kegon) Paddy... By 
De oO ~yy 
X=+0 .Yy=+0 -yn=+0 
aa i J aa | xf(B +x ES as 
: 4 2 Oe . dy, 
+ co 


The a Pane side of this equation has the value K 2 qo)dx, 


since le 70,)d, = <7 1 


Hence 


+ 00 


1 X=+0 .yy=+0 ayy= 
| aie [PG += + paen00 
Nis OD V1 —'— 00) 
oan 


oo ay, 


(10.2) Proof that the Gain K for the Signal x Equals the Gain of x 
through the Effective Non-linear Characteristic a(y) 


The d.c. value g(y) of the output from the non-linear element, 
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when a d.c. input y is applied together with inputs By 
+...),, has the value 
T 


e) = a fy + B + ydat 


lim To. ~0O 


An expression may be obtained for g(y) in terms of the amplitude 
probability distributions of the signals y;... ¥,, by using the 
same argument as that by which eqn. (3) was ; derived. Whence 


n= +0 


yi= +o 
EQ) = = | fy +B+ynOn - 


2=— 0 


OD AVDA 2 ayn 


If the signal x were now applied alone to the equivalent non- 
linear characteristic g(y) its gain would have the value [eqn. (9)] 


1 x= -+ 00 
1a =) xg (x)q(x)dx 
x x=-—@ 
Putting in the above value of g(y), 
Lette pa tO p= + 
K,= 3 i} same [ ef Bde COE 
X=—O “Yy=—O “Yn=—O Oe lcd aE 


Comparison of this expression with that for the equivalent gain 
K, derived in Section 10.1, shows that K, = K. 


(10.3) Derivation of the Characteristic g(y) 
When a d.c. signal y, is applied to the non-linear element 


Si 


ee we 


a ee ae 


together with the input (8 + y,), the d.c. output g;(y,) has the . 


value 
Yl aeee 


fiyp = | for +B +ydnovdan 
y= — 0 


If a d.c. signal y2 is now applied together with the signal y 


to a non-linearity having a characteristic the same as the effec- — 


tive characteristic g;(y1) above, the d.c. output has the value 


Yr2= + 0 


30/2) = I 8102 + Ya)PaV2)4V2 


la eo) 


y2= +00 
= =| Mur f iY +B+y¥1 4+ y2)pi1ODP22)dydy2 


r= + 00 


Repeating the above process, g,(y,) = | &—1(yp + »)P,-(,)dy,, 
Irn OO 


and if r has values 1 to m (corresponding to signals y, ... y,) _ 
then 

n= +0 .yi=+0 
enln) = | ES [ font B+ +... yD - 

m=— 0" yy=— 0 

4 . PAYA, «9 ¢ dn 


This expression for g,(y,) is the same as that obtained in 
Section 10.2 [i.e. ¢,(y,) = e(y)] and g(y) can therefore be 
obtained by the above process of integrations, using the d.c. 
cursors described in Section 5.1. 


(10.4) Proof that, for a Gaussian Signal, K = wes 


dp 


An input (B +x+y) is applied to the non-linear element, 
where f is the total d.c. level, x is a single a.c. signal, and y is 


(General Case) 


all other a.c. signals y; ... y,,. 
ilinearity has the value 


The d.c. output from the non- 
T 


a f(B +x + y)at 


ome 0 


hich, from Section 10.2, may be expressed 


YiI=+O —yn= 
a=[ a I. 7G bet aco). 
yy=—e0 ey ey, : 


d 
1 ee cls Xate) 


x=-+ co 


ody, 


‘Now, 


erefore 
a + 00 


“TP acalse x4 »} 


Pi) - 


idA 
dp 0 ~Yn=— 
» + PaWn)dyy ... Dp 
‘integrating by parts, 


‘geal FB +x+y|=[ewsGt+x+y]"> 


o— — 


iat 


x= + 00 


— | £8 +x + Yalan] 
‘For x a Gaussian signal, 


e— 4/202 


/(2 ons 


2 


q(x) = 


= os 


and d[q(x)] = = q(x)dx, where x 


1 X= +0 Yy=+0 ~prn=+o0 
z = | ee! [ xf(B + x + Yar). 
ed | ESE ave Oe 


This expression for dA/d8 corresponds exactly with that for the 
equivalent gain, derived in Section 10.1, ic. for a Gaussian 
signal in the presence of other signals with which it is uncorre- 
lated, K = dd/dB. 


(10.5) Calculation of the Ordinate Yp for D.C. Cursors 


\(a) Sine “ y = Esin wt 

iP(y) = for |y/E| <1, and p(y) = 0 for |y/E|> 1 
TEN/ ‘- —(y/E)] ue a 

/Within the limits i 


1 ae A 
IW/E| < 1, | poray = sin MoH) 


‘From Section 5.1, 


= — — where = Lae 
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Thus 
att ee (2R — 1) 
1 sin (FZ 4) - sin 1.0] = aN 
whence 23 = sin E GR=W- HI 
E D N 


(6) Gaussian Signal 


1 
a ee? 
From Section 5.1, 
YR 
1 (2R — 1) 
—y?2/202 berets eS 
| Ona. oar 

whence 
(2R—1 ‘ 


Ja YR pe: — 
Nb 0 5| + erf (aid) | where ert = sale e—edx 
and erf (—z) = — erf z. 
Eh kh) 
ert (aie ) N 


(10.6) Solution of Equation (25) 


Thus 


From eqn. (25), 
oan 221 TE 9) 


7 
| 
This can be written in the form 


o2 Ns 221 as Wy) 
ah 
0 


in which dy) = kwZw wy 

dy = kwe + 0:2kwo(w, + w2) + ww, 

(wy +2), d=. 

From Reference 2, the solution of this integral is 
(—dod3 + did) 

— dys — d?dy + didnds) 


For v = v,1, @1 = Wo/8 and wz = wo/4, and evaluation of the 
above expression gives 


(ey 


(oe) 


3 2 


P 
d 
(p + op + op? + 0-2kewop + ka)| 


where p = jw. 


oe) 


p? 2 


dyp* + d3p> + dyp* + dp + do 


d; => kwi(w, a >) + 0:2kw ww, 
d= 0: ane 


24 


o* = (a, 4 


Wy) di 


ie (0-2 + 413-3) 
On = Yat (0-2 + 16-48k + 220-4k2) 


Similarly, for v = v,7, 1 = Wo and wz = 2wW , 


(1 + 6-30k) 
"2 4+ 1-55k + 0-42k?) 


ie 
Dip) — 


DISCUSSION ON 


‘NON-LINEAR DISTORTION IN TRANSISTOR AMPLIFIERS AT LOW SIGNAL LEVELS 
AND LOW FREQUENCIES’* | 


Miss L. Luz (communicated): Previous work done by the 
writer and by Dr. Meyer can be extended to medium frequencies. 
In the monograph the assumption is made that all elements of 
the transistor are frequency-independent real quantities. It 
does not seem necessary to make this assumption; the same 
expressions in fact can be used to calculate the 2nd harmonic 
distortion factor at any frequency if the complex parameters of 
the transistor are known at this given frequency. 

2nd Harmonic Distortionat 100 kc]s.—The expressions obtained 
for the common-base configuration are 


Ke i = Nb K 4 = (Re az ht )Kp je (A) 
cs it | DY . 
where 
ae. One, CL + hb,Rp)? 1 Ode, 1 + AB Re = 1 dhe, R 
a a ee Teint 829 
1 df, /1 +AZR,\2 1 048, 1+73R 1 0h 
(gee 2 ( 22 #) Pu 21 224tL DS) 
Tey Aca 2dve we, Re * 4 ape%e 
Di = — [Ri(hiyhbs — hpyhby) + Ro + 18,Rr) + hE] 


The expressions obtained for the common-emitter con- 
figuration are 


Pe ee Ke (Re Klee 
Ks = ie —_ Zine C D; 11 D if, (B) 
where 
1 0A§, (1 + AZRzr\2 1 0A5, 1 +1S5R 1 Oh§ 
Vee i Bet) — i 22 1 ON12 pn» 
SA OTe KS, RT Rie tae werk 
1 OMS; (1 + Ao2Rz\2 1 «OAS, 1 + ASR 1 Oh§ 
pee 21 22 *) Ye 21 224A + 072 n2 
TIE hs 210 ene Beit wert 


Dt = — [RiCrirhhn — Ajah§1) + Red + AS.Rp) + hi] 


Both expressions were checked experimentally using a junction 
transistor type 3X/302 having a common-emitter cut-off fre- 
quency of approximately 10kc/s. The calculations and measure- 
ments were made at 100kc/s, the parameters being directly 
measured at this frequency. 

The results are shown in Fig. A. Quite good agreement 
between calculated and measured 2nd harmonic distortion 
factors can be seen. Different vaiues of R, (internal impedance 
of signal generator) were considered. 

Calculation from an Equivalent Circuit,—If the parameters of 
an equivalent circuit are known then the hA-parameters of a 
transistor can be calculated at any given frequency. From these 
the 2nd harmonic distortion can be calculated for a particular 
value of Re. The natural equivalent circuit shown in Fig. B 
could be used for this purpose.“ It is claimed that in a medium 


* Meyer, N. I.: Monograph No. 209 R, November 1956 (see 104 C, p. 208). 
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(i) Output distortion factor of common-base configuration as a 
function of internal resistance of signal generator. 

(ii) Output distortion factor of common-emitter configuration as a— 
function of internal resistance of signal generator. 


Ry = 4:7kilohms; V, = 6 volts; I, = 3mA; Vow = 4°7 volts (d.a.p.); f = 100ke/s 
© Calculated. x Measured. 


Pape 


Za =ta+jwLla 


frequency range Lg and Cy can be neglected® and the circuit 
then contains seven parameters, which must be measured in 
order to calculate the h-parameters. 

For the common-emitter configuration, the A-parameters | 
deduced from the natural equivalent circuit will be 


1 


Ky =f. 
MO arias 


©) 


hg, = : 
a (8 “VOC. )ra 


c= 8’ + joc, 
12" K(ge + jwC,) 


e 1 ° 1 , j , 
hin = # (ee aCe) 4 eee ee | 
d 


‘gq (2 as jwC,)r 
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For the common-base configuration they will be 


rat(ge + jwC)[K (ge + joe) + roy] + K(ge + joCe rope} 


= 
MS Klee + joe) ira + Ge +JaC)| + ronda +82 + JC, + by + fae) oe 
he — [ra an (gy + jwC,)] lbb’ ae (g. + jwC,) [K(g. + jwCy) ae rpp’] (H) 
a, [ra te (g¢’ + jwCy) |rpp’ + (g¢ + joC,)[K(g. + jwCy) ar rpp’ | a Krq(ge + jwCy) 
he — rep (Se + jwC,) Ale (g.’ + jwCy) ote ra] ats (g¢ + jwCy) (ge + jwC,) ) 
A Yop [ (Ser + joCy) te (ge + jwC,) a ra] Be K(g- + jac. + jwC, a3 rg) 
a 8e jwCe oe ja Cy 
BS ra + (ee + jwC.) + (ge + jo.) (K) 


The seven necessary measurements are rpy, 2., Ce, h%29 
{Lf value of hf), ASzo (Lf. value of AS,), A$; (1.f. value of h§,) 
and w, (angular frequency at which |h¢ >| is 3dB higher than 
h,5). 

From these measurements all the remaining four factors that 
appear in expressions (C)-(K) can be calculated as follows: 


8c! = KNinoSe 


Ce 


We 

‘Experimental verification for the common-emitter configura- 
tions was made using the same junction transistor, type 3X/302, 
as above. 

The results of measurements and calculations are shown in 
Fig. C. The agreement obtained between measured and cal- 
culated values falls off above 10kc/s: this is thought to be due 
to neglecting Lz and Cy in the equivalent circuit used to calculate 
the h-parameters. 
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= K(g, + joc.) (rg ae + jwC,) cf rpp'(&e! + jwC,’) 


Dr. N. I. Meyer (in reply): I would like to present a short 
résumé of our latest results in this field. 

In September, 1957, an extensive theoretical and experimental 
investigation was started in our laboratory with the purpose of 
finding values for the 2nd and 3rd harmonic distortions in 
junction transistors as a function of d.c. conditions, frequency, 
load impedance, Z,, and the internal impedance, Z,, of the 
signal generator. The experimental investigation included both 
low- and high-frequency transistor types in the frequency range 
from l1kc/s to 100kc/s. The ranges of emitter current and 
collector voltage were 1 to 15SmA and —1 to —15 volts, while 
Z, and Z, were varied from 10 ohms to 10 kilohms and from 
50 ohms to 25 kilohms. These values refer to the common-base 
configuration. 

The results of the investigation are currently being published 
in a series of laboratory progress reports (in Danish). The first 
part, which includes a theoretical treatment of almost-linear, 
complex 2-poles and 4-poles, and extends the work described 
in the monograph to medium frequencies, is already in print. 

The theoretical and experimental results obtained show that 
the expressions found for low-frequency distortion can be applied 
in the same form at higher frequencies, provided that the complex 
parameters, corresponding to the proper frequencies, are inserted. 
These results apparently agree with the experimental conclusions 
reached by Miss Luz. 

The same progress report cites a few of the experimental 
results on 2nd and 3rd harmonic distortion in a low-frequency 
transistor of type OC76. The measured values agree well with 
theoretical results calculated from h-parameters that have been 
derived, as functions of d.c. conditions and frequency, directly 
from the physical design parameters of the transistor. 


2nd HARMONIC MARGIN DISTORTION, dB 


FREQUENCY, kc/s 


Fig. C.—Output distortion factor of common-emitter configuration as a function of frequency for different 
internal resistances of signal generator. 


Rz = 4:7 kilohms; V, = 6 volts; Ig = 3MA; Vow = 4°7 volts (d.a.p.). 


© Calculated. 


x Measured. 
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(i) 


Fig. D 


(i) Second-harmonic distortion factor, and 


(ii) Third-harmonic distortion factor as a function of emitter direct current for common-base configuration. 
Load resistance, R; = 5 kilohms. 
Source resistance, Ry = 100 ohms. 
Base-collector bias, V; = — 3 volts. 
Collector alternating current, ig = 


O xX Calculated. 


It is planned to publish an English translation of the report. 
Fig. D provides an example of the preliminary results. The 
2nd harmonic distortion factor, kz, is defined as the ratio 
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ponent of the collector alternating current. 


0:1mA (r.m.s.). 
Measured. 


of amplitudes of the 2nd harmonic and the fundamental com- 
An analogous 
definition is used for the 3rd harmonic distortion factor, k3. 


DISCUSSION ON 
‘THE SLIDING CONTACT OF GRAPHITE AND COPPER’* 


Dr. D. A. Bell (communicated): I should like the author’s 
views on the suggestion of Shobertt} that the voltage drop between 
brush and copper in steady running conditions is largely asso- 
ciated with the work function of an insulating film of liquid or 
gas which the current traverses by quantum-mechanical tunnel 
effect. The author states that coherer bridges are formed under 
the negative brush at a temperature approximating to the melting 
point of the metal, but Holm? states that melting of copper does 
not occur because the greater part of the voltage drop across the 
junction occurs in the carbon. Is there any way of observing 
directly the temperature of the contact areas on the copper? 

Dr. W. Davies (in reply): In reply to Dr. Bell, I must point 
out that the contact resistance does not seem to be noticeably 
affected by the ‘tunnel’ resistance of adsorbed films of liquid or 
gas—see, for example, my Fig. 14, where the ‘tunnel’ resistance 
of an adsorbed film of water alone is seen to be quite negligible. 
It is worth noting also that, in an experiment with dry nitrogen, 
an increase in the pressure of this gas from 0-1 to 760mmHg 
resulted in a decrease of about 20% in the kinetic friction, but 
produced no really measurable change in the contact resistance. 
The conclusion here is that a film of the gas was formed aero- 
dynamically under the brush and extended even into the actual 
contact areas; however, it must have been extremely thin in these 


* Davies, W.: Monograph No. 271 U, December, 1957 (see 105 C, p. 203). 

} SHoperT, EB. I.: ‘Electrical Resistance of Carbon Brushes on Copper Rings’, 
Transactions of the American I.E.E., 1954, 73, Part ILIA, p. 788, 

t How, E.: ‘Contribution to the Theory of the Contact between a Carbon Brush 
and a Copper Collector Ring’, Journal of Applied Physics, 1957, 28, p. 1171. 


latter regions, and conduction through it was almost certainly 
by way of the tunnel effect. 
In actual fact the influence of adsorbed water on the contact 


. 


tt Ri a ie 


resistance seems to be markedly dependent upon the partial 


pressure of ambient oxygen. It appears then that this water, 


which may be chemisorbed at the oxide surface, does not simply | 


form a superficial discrete film through which the current might 
be expected to pass by virtue of the tunnel effect. 

With regard to Dr. Bell’s second point, reference to my Fig. 4 
shows that for a current of 20amp in a negative brush the 


voltage drop in the contact is 0-15 volt in vacuum, and about — 


0:45 volt in dry oxygen at a pressure of 50mmHg. The oxide 
(the reoxidized coherer bridges) under these latter conditions 
accounts for a voltage drop of 0:3 volt. This value is of course 
somewhat less than that needed for the maintenance of molten 
bridges, but since it was measured by an ordinary voltmeter it 
seems to be distinctly possible that rather higher transient 
voltages of short duration were developed across the oxide film. 
In such circumstances the bridges when first formed would be in 
a molten state. 

The main point, however, is that these bridges are in an 
activated condition when reoxidation occurs. 

The difficulty of direct measurement of the temperature at the 
contact areas of a current-carrying sliding pair is clearly very 
great, and I am not aware of any really satisfactory method for 
making such measurements. 


‘THE STATISTICAL BASIS 


Dr. R. Hancox (communicated): In Section 8 Dr. Lewis has 
(discussed the accuracy to be expected in a limited series of 
iimpulse tests which take the form of Bernoulli trials. The 
<statistical variation of the number of breakdowns Sy obtained 
iin a series of N trials at a fixed voltage is shown to follow a 
{binomial distribution, having a maximum standard deviation 
\when the probability of breakdown p is equal to 0:5. From 
this it is concluded that the 50% breakdown criterion often 
specified in impulse tests is not necessarily the best which could 
'be adopted, since the estimate of the probability of breakdown 
 Sy/N is least accurate at the 50° breakdown point. It is often 
(of greater interest, however, to consider the accuracy with which 
ithe voltage, as opposed to the probability of breakdown, is 
(determined in such tests. This may be calculated if the distribu- 
{tion of the probability of breakdown p with voltage v is known, 
‘at ad the values of p and v are found for which it is a maximum. 
| Taking an approximate form for the distribution of Sy 
{feqn. (9)], it can be shown that the values of p which may result 
iin a given value of Sy/N = P being obtained in an experiment 
(cover the range 


Ap ~ 2A INPG = PI] 


Ia ” 


‘where f is a factor determined by the confidence limits required. 
]For any given distribution of the probability of breakdown this 
imay be rewritten as the range of voltages Av which could give 
(a particular experimental result. If Av has a minimum value 
‘when p = 0:5, the 50% breakdown criterion can be obtained 
imore accurately than any other from a statistical point of view. 
._ Alternatively, if Av is not a minimum when p = 0°5, it may be 
‘advantageous to choose a different criterion. The distribution 
(of p which gives the boundary condition, Av independent of 
‘voltage, may be found, and if Ap is small it is given by 

—= z9) 
V, 


Ss 


(B) 


v 
pQ= (1 + sin 7/2 


'when —V,< v — Vox VJ;. 


The variation of p with voltage which is most often used to 
‘represent the results of impulse breakdown tests is the normal 
‘or Gaussian distribution, and this can be shown to give a 
‘minimum value for Av when p = 0:5. Therefore, the 50% 
breakdown criterion can be assumed to give the greatest statistical 
‘accuracy unless it is known that the variation of the probability 
of breakdown with voltage differs appreciably from the normal 
distribution. 

Dr. T. J. Lewis (in reply): Dr. Hancox has made an important 
contribution concerning the accuracy of measurement of impulse 


* Lewis, T. J.: Monograph No. 249 M, July, 1957 (see 105 C, p. 27). 
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OF IMPULSE TESTING’* 


tests. As stated in Section 8 of the paper, for any given value 
of p there is a dispersion in the value of Sy which has a maximum 
when p = 0:5. Egn. (A) is a statement of what might be called 
the converse effect; namely, if a given value of Sy (=NP, say) is 
found in a test, there is a range Ap which could give this. Ap 
also involves a constant f which is related in a complicated way 
with a required minimum probability of obtaining S,. In fact 
Ap lies symmetrically about the point 


v2 
2P +5, 


044) 


If the minimum probability is high, fis small and 
p= P and Ap ~ 2f[P( — P)/N]! 


As might be expected A p has a maximum f//N when p = P = 0°5. 
Eqn. (A) may be illustrated by reference to Fig. 6 of the paper, 
in which, as an artificial example, if N = 10 and a minimum 
probability of about 1% is required for S,/N = 0-27, then p 
might range between about 0:1 and 0°S. 

Dr. Hancox has rightly stressed that in practice it will be the 
range of voltage Av rather than the range of probability Ap 
that will be more important. If Ap is small, i.e. f small, then 


are (ee 


Eqn. (B) gives the required form of p such that dv/dp changes to 
keep Av constant over the whole range of p. The more important 
problem, however, is given a particular form of p as a function 
of v, to determine the value of p for which Av is a minimum. 
It is clear from the behaviour of A p that, if dp/dv should change 
little over a range about p = 0:5, then Av will not be a minimum 
at that point. This has been stated less precisely in Section 8 
of the paper. 

It is true that if p is represented as a function of » by a normal 
distribution then Av is a minimum at p = 0°5, but it is also 
true that quite a small change in slope dp/dv could alter this 
fact. For example, if the slope of the p — v curve at p = 0°8 
were increased by 20° above that for a normal distribution 
function, then a minimum of Av would occur at p = 0°8 and 
not at p = 0-5. 

Since there is no a priori reason why a normal distribution 
function should be assumed for p, it is not necessarily correct to 
infer that the 50% breakdown criterion is the best. Some 
knowledge of the characteristic is required in any specific case 
before the minimum point can be found. Unfortunately, the 
determination of this very characteristic involves the same 
procedures and accuracies that are under discussion. 


[p= 
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